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<210> 1 

<211> 1197 

<212> DNA 

<213> Pseudomonas putida 

<400> 1 



atgcacggct 


ccaacaagct 


cccaggattt 


gccacccgcg 


ccattcacca 


tggctacgac 


60 


ccccaggacc 


acggcggcgc 


actggtgcca 


ccggtctacc 


agaccgcgac 


gttcaccttc 


120 


cccaccgtgg 


aatacggcgc 


tgcgtgcttt 


gccggcgagc 


aggccgggca 


tttctacagc 


180 


cgcatctcca 


accccaccct 


caacctgctg 


gaagcacgca 


tggcctcgct 


ggaaggcggc 


240 


gaggccgggc 


tggcgctggc 


ctcgggcatg 


ggggcgatca 


cgtccacgct 


atggacactg 


300 


ctgcgccccg 


gtgacgaggt 


gctgctgggc 


aacaccctgt 


acggctgcac 


ctttgccttc 


360 


ctgcaccacg 


gcatcggcga 


gttcggggtc 


aagctgcgcc 


atgtggacat 


ggccgacctg 


420 


caggcactgg 


aggcggccat 


gacgccggcc 


acccgggtga 


tctatttcga 


gtcgccggcc 


480 


aaccccaaca 


tgcacatggc 


cgatatcgcc 


ggcgtggcga 


agattgcacg 


caagcacggc 


540 


gcgaccgtgg 


tggtcgacaa 


cacctactgc 


acgccgtacc 


tgcaacggcc 


actggagctg 


600 


ggcgccgacc 


tggtggtgca 


ttcggccacc 


aagtacctga 


gcggccatgg 


cgacatcact 


660 


gctggcattg 


tggtgggcag 


ccaggcactg 


gtggaccgta 


tacgtctgca 


gggcctcaag 


720 


gacatgaccg 


gtgcggtgct 


ctcgccccat 


gacgccgcac 


tgttgatgcg 


cggcatcaag 


780 


accctcaacc 


tgcgcatgga 


ccgccactgc 


gccaacgctc 


aggtgctggc 


cgagttcctc 


840 


gcccggcagc 


cgcaggtgga 


gctgatccat 


tacccgggcc 


tggcgagctt 


cccgcagtac 


900 


accctggccc 


gccagcagat 


gagccagccg 


ggcggcatga 


tcgccttcga 


actcaagggc 


960 





ggcatcggtg ccgggcggcg gttcatgaac gccctgcaac tgttcagccg cgcggtgagc 1020 
ctgggcgatg ccgagtcgct ggcgcagcac ccggcaagca tgactcattc cagctatacc 1080 
ccagaggagc gtgcgcatta cggcatctcc gaggggctgg tgcggttgtc ggtggggctg 1140 



<210> 2 

<211> 399 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Pseudomonas putida methionine gamma- lyase amino acid sequence, w 



ith a non-naturally occurring glycine residue inserted at positio 



gaagacatcg acgacctgct 



ggccgatgtg caacaggcac tcaaggcgag tgcctga 



1197 



n 2 



<400> 



2 



Met Gly His Gly Ser Asn Lys Leu Pro Gly Phe Ala Thr Arg Ala lie 
15 10 15 



His His Gly Tyr Asp Pro Gin Asp His Gly Gly Ala Leu Val Pro Pro 
20 25 30 



Val Tyr Gin Thr Ala Thr Phe Thr Phe Pro Thr Val Glu Tyr Gly Ala 
35 40 45 



Ala Cys Phe Ala Gly Glu Gin Ala Gly His Pro Tyr Ser Arg lie Ser 
50 55 60 



Asn Pro Thr Leu Asn Leu Leu Gin Ala Arg Met Ala Ser Leu Glu Gly 
65 70 75 80 



Gly Glu Ala Gly Leu Ala Leu Ala Ser Gly Met Gly Ala lie Thr Ser 
85 90 95 



Thr Leu Tyr Thr Leu Leu Arg Pro Gly Asp Glu Val Leu Leu Gly Asn 
100 105 110 



Thr Leu Tyr Gly Cys Thr Phe Ala Phe Leu His His Gly lie Gly Glu 
115 120 125 



Phe Gly Val Lys Leu Arg His Val Asp Met Ala Asp Leu Gin Ala Leu 
130 135 140 



Glu Ala Ala Met Thr Pro Ala Thr Arg Val lie Tyr Phe Glu Ser Pro 
145 150 155 160 



Ala Asn Pro Asn Met His Met Ala Asp lie Ala Gly Val Ala Lys lie 
165 170 175 



Ala Arg Lys His Gly Ala Thr Val Val Val Asp Asn Thr Tyr Cys Thr 
180 185 190 



Pro Tyr Leu Gin Arg Pro Leu Gin Leu Gly Ala Asp Leu Val Val His 
195 200 205 



Ser Ala Thr Lys Tyr Leu Ser Gly His Gly Asp lie Thr Ala Gly lie 
210 215 220 



Val Val Gly Ser Gin Ala Leu Val Asp Arg lie Arg Leu Gin Gly Leu 
225 230 235 240 



Lys Asp Met Thr Gly Ala Val Leu Ser Pro His Asp Ala Ala Leu Leu 
245 250 255 



Met Arg Gly lie Lys Thr Leu Asn Leu Arg Met Asp Arg His Cys Ala 
260 265 270 



Asn Ala Gin Val Leu Ala Glu Phe Leu Ala Arg Gin Pro Gin Val Glu 
275 280 285 



Leu lie His Tyr Pro Gly Leu Ala Ser Phe Pro Gin Tyr Thr Leu Ala 
290 295 300 



Arg Gin Gin Met Ser Gin Pro Gly Gly Met lie Ala Phe Glu Leu Lys 
305 310 315 320 



Gly Gly lie Gly Ala Gly Arg Arg Phe Met Asn Ala Leu Gin Leu Phe 
325 330 335 



Ser Arg Ala Val Ser Leu Gly Asp Ala Glu Ser Leu Ala Gin His Pro 
340 345 350 



Ala Ser Met Thr His Ser Ser Tyr Thr Pro Glu Glu Arg Ala His Tyr 
355 360 365 



Gly lie Ser Glu Gly Leu Val Arg Leu Ser Val Gly Leu Glu Asp lie 
370 375 380 

Asp Asp Leu Leu Ala Asp Val Gin Gin Ala Leu Lys Ala Ser Ala 
385 390 395 

<210> 3 
<211> 1202 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Pseudomonas putida methionine gamma-lyase sequence, with glycine 
codon inserted to incorporate restriction site and numerous natur 
ally occurring codons changed to codons more commonly used in ent 
eric bacteria 



<400> 3 
catgggtcac 


ggctccaaca 


aactgccggg 


ctttgctacc 


cgcgctatcc 


accacggtta 


60 


tgacccgcag 


gatcacggtg 


gtgcactggt 


tccgccggtt 


taccagactg 


ctactttcac 


120 


cttcccgacc 


gttgaatacg 


gcgctgcgtg 


ctttgctggc 


gaacaggctg 


gtcacttcta 


180 


ctcccgtatc 


tccaacccga 


ccctgaacct 


gctggaagca 


cgtatggcat 


ctctggaagg 


240 


cggcgaagct 


ggtctggcgc 


tggcatctgg 


tatgggcgcg 


atcacctcta 


ccctgtggac 


300 


cctgctgcgt 


ccgggtgacg 


aagttctgct 


gggcaacacc 


ctgtatggtt 


gtacttttgc 


360 


tttcctgcac 


cacggtatcg 


gtgaattcgg 


cgttaaactg 


cgtcacgtag 


atatggctga 


420 


cctgcaggca 


ctggaagcgg 


ctatgacccc 


ggctacccgt 


gttatctact 


tcgaatcccc 


480 


ggctaacccg 


aacatgcaca 


tggctgacat 


cgcaggtgtt 


gctaaaatcg 


ctcgtaagca 


540 


cggcgctacc 


gtagttgttg 


ataacaccta 


ctgtactccg 


tacctgcaac 


gtccgctgga 


600 


actgggcgct 


gacctggttg 


ttcactccgc 


tactaaatac 


ctgtccggcc 


acggcgacat 


660 


cactgctggc 


atcgtagtag 


gctcccaggc 


actggttgac 


cgtatccgtc 


tgcaaggtct 


720 


gaaagacatg 


accggcgctg 


ttctgtcccc 


gcacgacgca 


gcactgctga 


tgcgtggtat 


780 


caagaccctg 


aacctgcgta 


tggaccgtca 


ctgtgctaac 


gctcaggtac 


tggctgaatt 


840 


cctggctcgt 


cagccgcagg 


tagaactgat 


ccactatccg 


ggcctggctt 


ccttcccgca 


900 


gtacactctg 


gcacgtcagc 


agatgtccca 


gccgggcggt 


atgatcgctt 


tcgaactgaa 


960 


gggtggcatc 


ggcgctggtc 


gtcgtttcat 


gaacgctctg 


cagctgttct 


cccgtgcggt 


1020 


ttccctgggt 


gacgctgaat 


ccctggcgca 


gcacccggca 


tccatgactc 


actcctccta 


1080 


cactccggaa 


gaacgtgcgc 


actacggcat 


ctccgaaggc 


ctggttcgtc 


tgtctgttgg 


1140 



tctggaagac atcgatgatc tgctggcaga cgttcagcag gctctgaagg ctagcgcttg 1200 



ag 



1202 



<210> 4 
<211> 426 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cloning fragment of SEQ ID NO. 3 
<400> 4 

catgggtcac ggctccaaca aactgccggg ctttgctacc cgcgctatcc accacggtta 60 
tgacccgcag gatcacggtg gtgcactggt tccgccggtt taccagactg ctactttcac 120 
cttcccgacc gttgaatacg gcgctgcgtg ctttgctggc gaacaggctg gtcacttcta 180 
ctcccgtatc tccaacccga ccctgaacct gctggaagca cgtatggcat ctctggaagg 240 
cggcgaagct ggtctggcgc tggcatctgg tatgggcgcg atcacctcta ccctgtggac 3 00 
cctgctgcgt ccgggtgacg aagttctgct gggcaacacc ctgtatggtt gtacttttgc 360 
tttcctgcac cacggtatcg gtgaattcgg cgttaaactg cgtcacgtag atatggctga 420 
cctgca 426 

<210> 5 
<211> 441 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cloning fragment of SEQ ID NO. 3 
<400> 5 

caagaggcca tgggtcacgg ctccaacaaa ctgccgggct ttgctacccg cgctatccac 60 
cacggttatg acccgcagga tcacggtggt gcactggttc cgccggttta ccagactgct 120 
actttcacct tcccgaccgt tgaatacggc gctgcgtgct ttgctggcga acaggctggt 180 
cacttctact cccgtatctc caacccgacc ctgaacctgc tggaagcacg tatggcatct 240 
ctggaaggcg gcgaagctgg tctggcgctg gcatctggta tgggcgcgat cacctctacc 300 
ctgtggaccc tgctgcgtcc gggtgacgaa gttctgctgg gcaacaccct gtatggttgt 360 
acttttgctt tcctgcacca cggtatcggt gaattcggcg ttaaactgcg tcacgtagat 42 0 
atggctgacc tgcaggcact g 441 



<210> 6 

<211> 410 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cloning fragment of SEQ ID NO. 3 

<400> 6 



ggcactggaa 


gcggctatga 


ccccggctac 


ccgtgttatc 


tacttcgaat 


ccccggctaa 


60 


cccgaacatg 


cacatggctg 


acatcgcagg 


tgttgctaaa 


atcgctcgta 


agcacggcgc 


120 


taccgtagtt 


gttgataaca 


cctactgtac 


tccgtacctg 


caacgtccgc 


tggaactggg 


180 


cgctgacctg 


gttgttcact 


ccgctactaa 


atacctgtcc 


ggccacggcg 


acatcactgc 


240 


tggcatcgta 


gtaggctccc 


aggcactggt 


tgaccgtatc 


cgtctgcaag 


gtctgaaaga 


300 


catgaccggc 


gctgttctgt 


ccccgcacga 


cgcagcactg 


ctgatgcgtg 


gtatcaagac 


360 


cctgaacctg 


cgtatggacc 


gtcactgtgc 


taacgctcag 


gtactggctg 




410 



<210> 7 

<211> 430 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cloning fragment of SEQ ID NO. 3 

<400> 7 



gctgacctgc 


aggcactgga 


agcggctatg 


accccggcta 


cccgtgttat 


ctacttcgaa 


60 


tccccggcta 


acccgaacat 


gcacatggct 


gacatcgcag 


gtgttgctaa 


aatcgctcgt 


120 


aagcacggcg 


ctaccgtagt 


tgttgataac 


acctactgta 


ctccgtacct 


gcaacgtccg 


180 


ctggaactgg 


gcgctgacct 


ggttgttcac 


tccgctacta 


aatacctgtc 


cggccacggc 


240 


gacatcactg 


ctggcatcgt 


agtaggctcc 


caggcactgg 


ttgaccgtat 


ccgtctgcaa 


300 


ggtctgaaag 


acatgaccgg 


cgctgttctg 


tccccgcacg 


acgcagcact 


gctgatgcgt 


■360 


ggtatcaaga 


ccctgaacct 


gcgtatggac 


cgtcactgtg 


ctaacgctca 


ggtactggct 


420 


gaattcctgg 












430 



<210> 8 

<211> 366 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> Cloning fragment of SEQ ID NO. 3 
<400> 8 

aattcctggc tcgtcagccg caggtagaac tgatccacta tccgggcctg gcttccttcc 60 

cgcagtacac tctggcacgt cagcagatgt cccagccggg cggtatgatc gctttcgaac 120 

tgaagggtgg catcggcgct ggtcgtcgtt tcatgaacgc tctgcagctg ttctcccgtg 180 

cggtttccct gggtgacgct gaatccctgg cgcagcaccc ggcatccatg actcactcct 240 

cctacactcc ggaagaacgt gcgcactacg gcatctccga aggcctggtt cgtctgtctg 300 

ttggtctgga agacatcgat gatctgctgg cagacgttca gcaggctctg aaggctagcg 360 

cttgag 366 

<210> 9 
<211> 383 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cloning fragment of SEQ ID NO . 3 
<400> 9 

tcttaatgaa ttcctggctc gtcagccgca ggtagaactg atccactatc cgggcctggc 60 

ttccttcccg cagtacactc tggcacgtca gcagatgtcc cagccgggcg gtatgatcgc 120 

tttcgaactg aagggtggca tcggcgctgg tcgtcgtttc atgaacgctc tgcagctgtt 180 

ctcccgtgcg gtttccctgg gtgacgctga atccctggcg cagcacccgg catccatgac 240 

tcactcctcc tacactccgg aagaacgtgc gcactacggc atctccgaag gcctggttcg 3 00 

tctgtctgtt ggtctggaag acatcgatga tctgctggca gacgttcagc aggctctgaa 3 60 

ggctagcgct tgaggatcca cca 3 83 



<210> 10 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO . 3 

<400> 10 

caagaggcca tgggtcacgg ctccaacaaa ctg 3 3 



<210> 11 
<211> 114 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Primer/ template derived from SEQ ID NO. 3 
<400> 11 

cacggctcca acaaactgcc gggctttgct acccgcgcta tccaccacgg ttatgacccg 
caggatcacg gtggtgcact ggttccgccg gtttaccaga ctgctacttt cacc 



<210> 12 
<211> 116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 
<400> 12 

gcttccagca ggttcagggt cgggttggag atacgggagt agaagtgacc agcctgttcg 
ccagcaaagc acgcagcgcc gtattcaacg gtcgggaagg tgaaagtagc agtctg 



<210> 13 
<211> 117 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO . 3 
<400> 13 

ctgaacctgc tggaagcacg tatggcatct ctggaaggcg gcgaagctgg tctggcgctg 
gcatctggta tgggcgcgat cacctctacc ctgtggaccc tgctgcgtcc gggtgac 



<210> 14 
<211> 116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO. 3 
<400> 14 

gccatatcta cgtgacgcag tttaacgccg aattcaccga taccgtggtg caggaaagca 
aaagtacaac catacagggt gttgcccagc agaacttcgt cacccggacg cagcag 



<210> 15 

<211> 33 

<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> Primer/ template derived from SEQ ID NO. 3 
<400> 15 

cagtgcctgc aggtcagcca tatctacgtg acg 33 



<210> 16 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO . 3 

<400> 16 

gctgacctgc aggcactgga agcggctatg acc 33 



<210> 17 
<211> 114 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO. 3 
<400> 17 

ctggaggctg ctatgacccc ggctacccgt gttatctact tcgaatcccc ggctaacccg 60 
aacatgcaca tggctgacat cgcaggtgtt gctaaaatcg ctcgtaagca cggc 114 



<210> 18 
<211> 115 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO . 3 
<400> 18 

ggtatttagt agcggagtga acaaccaggt cagcgcccag ttccagcgga cgttgcaggt 60 
acggagtaca gtaggtgtta tcaacaacta cggtagcgcc gtgcttacga gcgat 115 



<210> 19 

<211> 111 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO . 3 



<400> 19 




cactccgcta ctaaatacct gtccggccac ggcgacatca ctgctggcat cgtagtaggc 



60 



tcccaggcac tggttgaccg tatccgtctg caaggtctga aagacatgac c 



111 



<210> 20 
<211> 115 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 
<400> 20 

gtacctgagc gttagcacag tgacggtcca tacgcaggtt cagggtcttg ataccacgca 60 
tcagcagtgc tgcgtcgtgc ggggacagaa cagcgccggt catgtctttc agacc 115 



<210> 21 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 



<210> 22 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO. 3 

<400> 22 

tcttaatgaa ttcctggctc gtcagccgca g 31 



<210> 23 

<211> 105 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO. 3 



<400> 21 

ccaggaattc agccagtacc tgagcgttag cac 



33 



<400> 23 

ctggctcgtc agccgcaggt agaactgatc cactatccgg gcctggcttc cttcccgcag 



60 



tacactctgg cacgtcagca gatgtcccag ccgggcggta tgatc 



105 




<210> 24 
<211> 106 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 
<400> 24 

cgtcacccag ggaaaccgca cgggagaaca gctgcagagc gttcatgaaa cgacgaccag 60 
cgccgatgcc acccttcagt tcgaaagcga tcatgccacc cggctg 106 



<210> 25 

<211> 106 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 



<210> 26 
<211> 98 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/template derived from SEQ ID NO. 3 
<400> 26 

caagcgctag ccttcagagc ctgctgaacg tctgccagca gatcatcgat gtcttccaga 60 
ccaacagaca gacgaaccag gccttcggag atgccgta 98 



<210> 27 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer/ template derived from SEQ ID NO. 3 



<400> 
gcggttt 



25 

ccc tgggtgacgc tgaatccctg gcgcagcacc cggcatccat gactcactcc 



60 



tcctacactc cggaagaacg tgcgcactac ggcatctccg aaggcc 



106 



<400> 27 

tggtggatcc tcaagcgcta gccttcagag cc 



32 



<210> 28 
<211> 1197 
<212> DNA 



<213> Artificial Sequence 
<220> 

<223> Methionine gamma-lyase gene derived from Pseudomonas putida, havi 
ng all arginine codons replaced with arginine codons found most c 
ommonly in E. coli 



<400> 28 



atgcacggct 


ccaacaagct 


cccaggattt 


gccacccgcg 


ccattcacca 


tggctacgac 


60 


ccccaggacc 


acggcggcgc 


actggtgcca 


ccggtctacc 


agaccgcgac 


gttcaccttc 


120 


cccaccgtgg 


aatacggcgc 


tgcgtgcttt 


gccggcgagc 


aggccgggca 


tttctacagc 


180 


cgcatctcca 


accccaccct 


caacctgctg 


gaagcacgca 


tggcctcgct 


ggaaggcggc 


240 


gaggccgggc 


tggcgctggc 


ctcgggcatg 


ggggcgatca 


cgtccacgct 


atggacactg 


300 


ctgcgccccg 


gtgacgaggt 


gctgctgggc 


aacaccctgt 


acggctgcac 


ctttgccttc 


360 


ctgcaccacg 


gcatcggcga 


gttcggggtc 


aagctgcgcc 


atgtggacat 


ggccgacctg 


420 


caggcactgg 


aggcggccat 


gacgccggcc 


acccgtgtga 


tctatttcga 


gtcgccggcc 


480 


aaccccaaca 


tgcacatggc 


cgatatcgcc 


ggcgtggcga 


agattgcacg 


caagcacggc 


540 


gcgaccgtgg 


tggtcgacaa 


cacctactgc 


acgccgtacc 


tgcaacgtcc 


actggagctg 


600 


ggcgccgacc 


tggtggtgca 


ttcggccacc 


aagtacctga 


gcggccatgg 


cgacatcact 


660 


gctggcattg 


tggtgggcag 


ccaggcactg 


gtggaccgta 


tacgtctgca 


gggcctcaag 


720 


cracataacccr 

*j C4. w C*A w V* *J 


gtgcggtgct 


ctcgccccat 


gacgccgcac 


tgttgatgcg 


cggcatcaag 


780 


accctcaacc 


tgcgcatgga 


ccgccactgc 


gccaacgctc 


aggtgctggc 


cgagttcctc 


840 


gcccgtcagc 


cgcaggtgga 


gctgatccat 


tacccgggcc 


tggcgagctt 


cccgcagtac 


900 


accctggccc 


gccagcagat 


gagccagccg 


ggcggcatga 


tcgccttcga 


actcaagggc 


960 


ggcatcggtg 


ccgggcgtcg 


tttcatgaac 


gccctgcaac 


tgttcagccg 


cgcggtgagc 


1020 


ctgggcgatg 


ccgagtcgct 


ggcgcagcac 


ccggcaagca 


tgactcattc 


cagctatacc 


1080 


ccagaggagc 


gtgcgcatta 


cggcatctcc 


gaggggctgg 


tgcgtttgtc 


ggtggggctg 


1140 


gaagacatcg 


acgacctgct 


ggccgatgtg 


caacaggcac 


tcaaggcgag 


tgcctga 


1197 



<210> 29 

<211> 1197 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Methionine gamma-lyase gene derived from Pseudomonas putida havin 
g all rare arginine, leucine, isoleucine, and proline codons repl 



• 



aced with respective corresponding codons found most commonly in 
E. coli 



<400> 29 
atgcacggct 


ccaacaagct 


cccaggattt 


gccacccgcg 


ccattcacca 


tggctacgac 


60 


ccgcaggacc 


acggcggcgc 


actggtgcca 


ccggtctacc 


agaccgcgac 


gttcaccttc 


120 


ccgaccgtgg 


aatacggcgc 


tgcgtgcttt 


gccggcgagc 


aggccgggca 


tttctacagc 


180 


cgcatctcca 


acccgaccct 


caacctgctg 


gaagcacgca 


tggcctcgct 


ggaaggcggc 


240 


gaggccgggc 


tggcgctggc 


ctcgggcatg 


ggggcgatca 


cgtccacgct 


gtggacactg 


300 


ctgcgcccgg 


gtgacgaggt 


gctgctgggc 


aacaccctgt 


acggctgcac 


ctttgccttc 


360 


ctgcaccacg 


gcatcggcga 


gttcggggtc 


aagctgcgcc 


atgtggacat 


ggccgacctg 


420 


caggcactgg 


aggcggccat 


gacgccggcc 


acccgtgtga 


tctatttcga 


gtcgccggcc 


* 480 


aacccgaaca 


tgcacatggc 


cgatatcgcc 


ggcgtggcga 


agattgcacg 


caagcacggc 


540 


gcgaccgtgg 


tggtcgacaa 


cacctactgc 


acgccgtacc 


tgcaacgtcc 


actggagctg 


600 


ggcgccgacc 


tggtggtgca 


ttcggccacc 


aagtacctga 


gcggccatgg 


cgacatcact 


660 


gctggcattg 


tggtgggcag 


ccaggcactg 


gtggaccgta 


tccgtctgca 


gggcctcaag 


720 


gacatgaccg 


gtgcggtgct 


ctcgccgcat 


gacgccgcac 


tgttgatgcg 


cggcatcaag 


780 


accctcaacc 


tgcgcatgga 


ccgccactgc 


gccaacgctc 


aggtgctggc 


cgagttcctc 


840 


gcccgtcagc 


cgcaggtgga 


gctgatccat 


tacccgggcc 


tggcgagctt 


cccgcagtac 


900 


accctggccc 


gccagcagat 


gagccagccg 


ggcggcatga 


tcgccttcga 


actcaagggc 


960 


ggcatcggtg 


ccgggcgtcg 


tttcatgaac 


gccctgcaac 


tgttcagccg 


cgcggtgagc 


1020 


ctgggcgatg 


ccgagtcgct 


ggcgcagcac 


ccggcaagca 


tgactcattc 


cagctatacc 


1080 


ccagaggagc 


gtgcgcatta 


cggcatctcc 


gaggggctgg 


tgcgtttgtc 


ggtggggctg 


1140 


gaagacatcg 


acgacctgct 


ggccgatgtg 


caacaggcac 


tcaaggcgag 


tgcctga 


1197 



<210> 30 
<211> 1200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Methionine gamma -lyase gene derived from Pseudomonas putida havin 
g all codons replaced with respective corresponding codons found 
most commonly in E. coli 

<400> 30 

atgggtcacg gctccaacaa actgccgggt tttgctaccc gtgctatcca ccacggctac 60 







• 






• 




gacccgcagg 


accacggcgg 


cgcactggtt 


ccgccggttt 


accagaccgc 


gaccttcacc 


120 


ttcccgaccg 


ttgaatacgg 


cgctgcgtgc 


tttgctggcg 


aacaggctgg 


tcacttctac 


180 


tcccgtatct 


ccaacccgac 


cctgaacctg 


ctggaagcac 


gtatggcttc 


cctggaaggc 


240 


ggcgaagctg 


gtctggcgct 


ggcttccggc 


atgggtgcga 


tcacctccac 


cctgtggacc 


300 


ctgctgcgtc 


cgggtgacga 


agttctgctg 


ggcaacaccc 


tgtacggctg 


cacctttgct 


360 


ttcctgcacc 


acggcatcgg 


cgaattcggt 


gttaagctgc 


gtcacgttga 


catggctgac 


420 


ctgcaggcac 


tggaagcggc 


tatgaccccg 


gctacccgtg 


ttatctactt 


cgaatccccg 


480 


gctaacccga 


acatgcacat 


ggctgaaatc 


gctggcgttg 


cgaagatcgc 


acgtaagcac 


540 


ggcgcgaccg 


ttgttgttga 


caacacctac 


tgcaccccgt 


acctgcaacg 


tccgctggaa 


600 


ctgggcgctg 


acctggttgt 


tcactccgct 


accaagtacc 


tgtccggcca 


cggcgacatc 


660 


actgctggca 


tcgttgttgg 


ctcccaggca 


ctggttgacc 


gtatccgtct 


gcaaggcctg 


720 


aaggacatga 


ccggtgcggt 


tctgtccccg 


cacgacgctg 


cactgctgat 


gcgtggcatc 


780 


aagaccctga 


acctgcgtat 


ggaccgtcac 


tgcgctaacg 


ctcaggttct 


ggctgaattc 


840 


ctggctcgtc 


agccgcaggt 


tgaactgatc 


cactacccgg 


gcctggcgtc 


cttcccgcag 


900 


tacaccctgg 


ctcgtcagca 


gatgtcccag 


ccgggcggca 


tgatcgcttt 


cgaactgaag 


960 


ggcggcatcg 


gtgctggtcg 


tcgtttcatg 


aacgctctgc 


agctgttctc 


ccgtgcggtt 


1020 


tccctgggcg 


aagctgaatc 


cctggcgcag 


cacccggcat 


ccatgactca 


ctcctcctac 


1080 


accccggaag 


aacgtgcgca 


ctacggcatc 


tccgaaggtc 


tggttcgtct 


gtccgttggt 


1140 


CtyyadgdCd 


i* pna pna npi" 


y i— Lyy u cia 


m l y y y 


rartaaacrac 

V** d \* w ^-j t*. t*. 


gagtgc t tga 


1200 


<210> 31 
<211> 972 
<212> DNA 

<213> Sporidiobolus salmonicolor 








<400> 31 
atggtcggca 


ctactaccct 


caacactggc 


gcttccctcg 


agctcgtcgg 


ctacggcacg 


60 


tggcaggcag 


caccgggcga 


ggtgggccag 


ggcgtcaagg 


tcgccatcga 


gactggatac 


120 


cgtcacctcg 


accttgccaa 


ggtctactcg 


aaccaacctg 


aggttggtgc 


cgccatcaag 


180 


gaggctggcg 


tcaagcgcga 


ggacctcttc 


atcacctcga 


agctctggaa 


caactcgcac 


240 


cgcccggagc 


aggtcgagcc 


tgcccttgac 


gacaccctca 


aggagctcgg 


cctcgagtac 


300 


ctcgaccttt 


acctcattca 


ctggcccgtc 


gcgttcccgc 


ccgagggcga 


catcacccag 


360 



aacctcttcc cgaaggccaa cgacaaggag gtcaagctcg acctggaggt cagcctcgtc 42 0 

gacacgtgga aggcgatggt caagcttctc gacactggca aggtcaaggc gatcggcgtt 480 

tccaacttcg acgcgaagat ggtcgacgcc atcatcgagg ctaccggcgt gaccccctcc 540 

gtcaaccaga tcgagcgtca ccctctcctt ctccagcccg agctcatcgc ccaccacaag 600 

gccaagaaca ttcacattac cgcatactct cctctcggta acaacaccgt cggcgcgcct 660 

cttcttgtcc agcacccgga gatcaagcgc atcgccgaga agaacggctg cacgcccgct 72 0 

caggtcctca ttgcctgggc catcgttggc ggccactcgg ttatccccaa gtcggtcacc 780 

ccctcccgca ttggcgagaa cttcaagcag gtctcgctct cgcaggagga cgtcgatgcc 84 0 

gtcagcaagc tcggcgaggg ttcgggccgc aggcgctaca acatcccctg cacgtactcg 900 

cccaagtggg acatcaacgt ctttggcgag gaggacgaga agtcgtgcaa gaacgccgtg 960 

aagatcaagt ag 972 

<210> 32 
<211> 322 
<212> PRT 

<213> Sporidiobolus salmonicolor 
<400> 32 

Met Val Gly Thr Thr Thr Leu Asn Thr Gly Ala Ser Leu Glu Leu Val 



Gly Tyr Gly Thr Trp Gin Ala Ala Pro Gly Glu Val Gly Gin Gly Val 
20 25 30 

Lys Val Ala lie Glu Thr Gly Tyr Arg His Leu Asp Leu Ala Lys Val 
35 40 45 

Tyr Ser Asn Gin Pro Glu Val Gly Ala Ala lie Lys Glu Ala Gly Val 
50 55 60 

Lys Arg Glu Asp Leu Phe lie Thr Ser Lys Leu Trp Asn Asn Ser His 
65 70 75 80 

Arg Pro Glu Gin Val Glu Pro Ala Leu Asp Asp Thr Leu Lys Glu Leu 



Gly Leu Glu Tyr Leu Asp Leu Tyr Leu lie Trp Pro Val Ala Phe Pro 
100 105 110 



Pro Glu Gly Asp lie Thr Gin Asn Leu Phe Pro Lys Ala Asn Asp Lys 
115 120 125 



Glu Val Lys Leu Asp Leu Glu Val Ser Leu Val Asp Thr Trp Lys Ala 
130 135 140 



Met Val Lys Leu Leu Asp Thr Gly Lys Val Lys Ala lie Gly Val Ser 
145 150 155 160 



Asn Phe Asp Ala Lys Met Val Asp Ala He He Glu Ala Thr Gly Val 
165 170 175 



Thr Pro Ser Val Asn Gin He Glu Arg His Pro Leu Leu Leu Gin Pro 
180 185 190 



Glu Leu He Ala His His Lys Ala Lys Asn He His He Thr Ala Tyr 
195 200 205 



Ser Pro Leu Gly Asn Asn Thr Val Gly Ala Pro Leu Leu Val Gin His 
210 215 220 



Pro Glu He Lys Arg He Ala Glu Lys Asn Gly Cys Thr Pro Ala Gin 
225 230 235 240 



Val Leu He Ala Trp Ala He Val Gly Gly His Ser Val He Pro Lys 
245 250 255 



Ser Val Thr Pro Ser Arg He Gly Glu Asn Phe Lys Gin Val Ser Leu 
260 265 270 



Ser Gin Glu Asp Val Asp Ala Val Ser Lys Leu Gly Glu Gly Ser Gly 
275 280 285 



Arg Arg Arg Tyr Asn He Pro Cys Thr Tyr Ser Pro Lys Trp Asp He 
290 295 300 



Asn Val Phe Gly Glu Glu Asp Glu Lys Ser Cys Lys Asn Ala Val Lys 
305 310 315 320 



He Lys 



<210> 33 
<211> 972 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic gene derived from Sporidiobolus salmonicolor NADPH-Depe 
ndent Aldehyde Reductase 1, having numerous codons replaced with 
others encoding the same amino acids to reduce the free energy of 
folding 



<400> 33 
atggttggta 


ctactactct 


gaacactggt 


gcatctctgg 


aactggtagg 


ttatggtact 


60 


tggcaagctg 


ctccgggcga 


agtaggtcaa 


ggtgtaaaag 


tagctatcga 


aactggttat 


120 


cgtcatctgg 


atctggcaaa 


agtatactct 


aaccagccgg 


aagtaggtgc 


agcaatcaag 


180 


gaagctggcg 


ttaaacgtga 


ggatctgttt 


atcacttcta 


aactgtggaa 


caactcccac 


240 


cgtccggaac 


aggtagaacc 


ggctctggat 


gatactctga 


aagaactggg 


cctggagtat 


300 


ctggacctgt 


acctgatcca 


ctggccggta 


gcatttccgc 


cggaaggtga 


tatcactcag 


360 


aacctgttcc 


cgaaagctaa 


cgataaagaa 


gtaaaactgg 


acctggaagt 


ttctctggta 


420 


gacacttgga 


aagcaatggt 


aaaactgctg 


gatactggta 


aagttaaagc 


tatcggtgtt 


480 


tccaactttg 


acgcaaaaat 


ggttgacgct 


atcatcgaag 


caactggcgt 


aactccgtct 


540 


gttaaccaga 


tcgaacgtca 


cccgctgctg 


ctgcagccag 


agctgatcgc 


acaccacaaa 


600 


gctaaaaaca 


tccacatcac 


cgcatactcc 


ccgctgggta 


acaacaccgt 


aggcgcaccg 


660 


ctgctggtac 


aacacccgga 


aatcaaacgt 


atcgctgaaa 


aaaacggctg 


tactccggct 


720 


caggtactga 


tcgcatgggc 


tatcgtaggt 


ggtcattctg 


ttatcccgaa 


atccgtaact 


780 


ccgtctcgta 


ttggcgaaaa 


cttcaaacag 


gtttctctgt 


ctcaggaaga 


tgttgatgct 


840 


gtttctaagc 


tgggcgaagg 


ttccggtcgt 


cgtcgttata 


acatcccgtg 


cacttattcc 


900 


ccgaagtggg 


atatcaacgt 


tttcggtgaa 


gaagatgaaa 


aatcctgtaa 


aaacgctgtt 


960 


aaaatcaaat 


aa 










972 


<210> 34 
<211> 1032 
<212> DNA 

<213> Sporidiobolus salmonicolor 








<400> 34 
atggccaaaa 


tcgacaacgc 


tgtgcttccc 


gagggctcgc 


tcgtgctcgt 


caccggcgcc 


60 


aacggcttcg 


tcgcttcgca 


cgtcgtcgaa 


cagctccttg 


aacacggtta 


caaggtccgt 


120 







• 






• 




ggtacggctc 


gtagugcexe 


^ ?s ^ /■« ^ t~tr* f~> 

CadaCLtgCC 


ddu u uy u ay a. 


ay uy v~ Lyyy a 


tgecaagtae 


180 


cccggtcgct 


tegagaegge 


cgtggccgag 


y du d uy u uua 


aauayyy _j ^ 


ttacaacaaa 

W ^ 


240 


gtgatcaagg 


gcgccgccgg 




d u u y u u uu-uy 


V»- l« V4 t» w w l» 


ctcaaacaaa 


300 


tacgacgagg 


ccgccacccc 


cgccaucyy a. 


yyLaLLL uua 


apart rtcca 


taccaccact 


360 


gccacgccct 


ctgtcaagcg 


c uucgt.ee cc 


aCCLut-LLya 


LLy u u u^ay ^ 


art* tatcccc 


420 


aagccgaatg 


tcgaggggat 


CCaCCCCgaC 


y dy ctciy u uu u 


ana a rpf PCf3 


aaacatcaac 

d *L*» » 


480 


aaggccaaga 


ctctccctga 


aagcy clu-l-l-l 


Lay d dy u uy u> 


t" r t~ acra t c t a 


caccacaaac 

w ^— * ~] *-* ^ 


540 


aagaccgagg 


eggagcttge 


cgc u uy y add 


u uuduyyduy 


aaaaraaacc 

ay aa^ »-» 


acacttcacc 


600 


ctcaacgccg 


tcctccccaa 


c uacacgau u 


LjyyaCydLLL 


LLyciLH*^ya 


aa ppcaatcc 


660 


ggctcgactt 


cgggccggac 


gauy cig uu- uu 


hfraal" fro c n 
u uuaauyy^-y 


aaat*ttcccc 


cgccctcgct 


720 


ctgatgcccc 


c t cagtac t a 


cgugucyyuc. 


y LLyaLdL u y 


CflT'tTT't'CIPa 
y L\< v_* *h \— c*. 


cctcaaatac 


780 


ttggttctgc 


cccagatcga 


gcgccgccgc 


y ll LaLy y ua 


uLy w^yyoaL 


at - *r raactaa 


840 


aacacggtcc 


t cgcgacgtt 


/-» /-i /t a a /-» /"i 'H /T 

CCyCaayuty 


LdLLLydyLa 


a rra pat" t" PPC 


ggecgact t c 


900 


cccgaccagg 


gccaggacct 


CCCCaay LLC 


ydLdLyyLLL 


ppfanrrtrna 

y y ^*^yci 


aat fctcaaa 


960 


agtx cgggca 


ggcccgggtg 


y dy y u uy a. u »^ 




tcaaggacct 


catcaqctcc 


1020 


gaaaccgett 


ga 










1032 



<210> 35 
<211> 343 
<212> PRT 

<213> Sporidiobolus salmonicolor 
<400> 35 

Met Ala Lys He Asp Asn Ala Val Leu Pro Glu Gly Ser Leu Val Leu 
15 10 15 

Val Thr Gly Ala Asn Gly Phe Val Ala Ser His Val Val Glu Gin Leu 
20 25 30 

Leu Glu His Gly Tyr Lys Val Arg Gly Thr Ala Arg Ser Ala Ser Lys 
35 40 45 



Leu Ala Asn Leu Gin Lys Arg Trp Asp Ala Lys Tyr Pro Gly Arg Phe 
50 55 60 



Glu Thr Ala Val Val Glu Asp Met Leu Lys Gin Gly Ala Tyr Asp Glu 
65 70 75 80 



Val lie Lys Gly Ala Ala Gly Val Ala His lie Ala Ser Val Val Ser 
85 90 95 



Phe Ser Asn Lys Tyr Asp Glu Val Val Thr Pro Ala lie Gly Gly Thr 
100 105 110 



Leu Asn Ala Leu Arg Ala Ala Ala Ala Thr Pro Ser Val Lys Arg Phe 
115 120 125 



Val Leu Thr Ser Ser Thr Val Ser Ala Leu lie Pro Lys Pro Asn Val 
130 135 140 



Glu Gly lie Tyr Leu Asp Glu Lys Ser Trp Asn Leu Glu Ser lie Asp 
145 150 155 160 



Lys Ala Lys Thr Leu Pro Glu Ser Asp Pro Gin Lys Ser Leu Trp Val 
165 170 175 



Tyr Ala Ala Ser Lys Thr Glu Ala Glu Leu Ala Ala Trp Lys Phe Met 
180 185 190 



Asp Glu Asn Lys Pro His Phe Thr Leu Asn Ala Val Leu Pro Asn Tyr 
195 200 205 



Thr lie Gly Thr lie Phe Asp Pro Glu Thr Gin Ser Gly Ser Thr Ser 
210 215 220 



Gly Trp Met Met Ser Leu Phe Asn Gly Glu Val Ser Pro Ala Leu Ala 
225 230 235 240 



Leu Met Pro Pro Gin Tyr Tyr Val Ser Ala Val Asp lie Gly Leu Leu 
245 250 255 



His Leu Gly Cys Leu Val Leu Pro Gin He Glu Arg Arg Arg Val Tyr 
260 265 270 



Gly Thr Ala Gly Thr Phe Asp Trp Asn Thr Val Leu Ala Thr Phe Arg 
275 280 285 



Lys Leu Tyr Pro Ser Lys Thr Phe Pro Ala Asp Phe Pro Asp Gin Gly 



290 295 300 



Gin Asp Leu Ser Lys Phe Asp Thr Ala Pro Ser Leu Glu lie Leu Lys 
305 310 315 320 



Ser Leu Gly Arg Pro Gly Trp Arg Ser lie Glu Glu Ser lie Lys Asp 
325 330 335 



Leu Val Gly Ser Glu Thr Ala 
340 



<210> 36 
<211> 1032 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Sporidiobolus salmonicolor NADPH-Depe 
ndent Aldehyde Reductase 2, having numerous codons replaced with 
others encoding the same amino acids to reduce the free energy of 
folding, and an ala to gly mutation at amino acid position 2 



<400> 36 
atggctaaaa 


tcgataacgc 


agttctgccg 


gaaggttccc 


tggttctggt 


taccggtgct 


60 


aacggtttcg 


ttggttccca 


cgttgttgaa 


cagctgctgg 


aacacggtta 


caaagttcgt 


120 


ggtaccgctc 


gttccgcttc 


caaactggct 


aacctgcaga 


aacgttggga 


cgctaaatac 


180 


ccgggtcgtt 


tcgaaaccgc 


tgttgttgaa 


gacatgctga 


aacagggtgc 


ttacgacgaa 


240 


gttatcaaag 


gtgctgctgg 


tgttgctcac 


atcgcttccg 


ttgtttcctt 


ctccaacaaa 


300 


tacgacgaag 


. ttgttacccc 


ggctatcggt 


ggtaccctga 


acgctctgcg 


tgctgctgct 


360 


gctaccccgt 


ccgttaaacg 


tttcgttctg 


acctcctcca 


ccgtttccgc 


tctgatcccg 


420 


aaaccgaacg 


ttgaaggtat 


ctacctggac 


gaaaaatcct 


ggaacctgga 


atccatcgac 


480 


aaagctaaaa 


ccctgccgga 


atccgacccg 


cagaaatccc 


tgtgggtata 


cgctgcatcc 


540 


aagaccgaag 


ctgaactggc 


tgcatggaaa 


tttatggatg 


agaacaagcc 


acacttcact 


600 


ctgaacgctg 


tactgccaaa 


ctacactatt 


ggcactatct 


tcgatccgga 


aactcagtcc 


660 


ggttccacct 


ccggttggat 


gatgtccctg 


tttaacggcg 


aggtttcccc 


ggctctggct 


720 


ctgatgccac 


cgcagtacta 


cgtttccgct 


gttgatattg 


gcctgctgca 


cctgggttgc 


780 


ctggttctgc 


cacaaatcga 


acgtcgtcgt 


gtttacggta 


ctgctggtac 


tttcgattgg 


840 


aacaccgttc 


tggctacctt 


ccgtaaactg 


tacccgtcca 


aaaccttccc 


ggctgacttc 


900 



ccagatcaag gtcaggacct gtctaaattc gacaccgctc cgtccctgga aattctgaaa 960 
tctctgggtc gcccaggttg gcgttccatc gaagaatcca tcaaagacct ggttggttcc 1020 
gaaaccgctt aa 1032 



<210> 37 
<211> 343 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Sporidiobolus salmonicolor NADPH-D 
ependent Aldehyde Reductase 2, having an ala to gly mutation at a 
mino acid position 2 

<400> 37 

Met Ala Lys lie Asp Asn Ala Val Leu Pro Glu Gly Ser Leu Val Leu 
15 10 15 



Val Thr Gly Ala Asn Gly Phe Val Gly Ser His Val Val Glu Gin Leu 
20 25 30 



Leu Glu His Gly Tyr Lys Val Arg Gly Thr Ala Arg Ser Ala Ser Lys 
35 40 45 



Leu Ala Asn Leu Gin Lys Arg Trp Asp Ala Lys Tyr Pro Gly Arg Phe 
50 55 60 



Glu Thr Ala Val Val Glu Asp Met Leu Lys Gin Gly Ala Tyr Asp Glu 
65 70 75 80 



Val He Lys Gly Ala Ala Gly Val Ala His He Ala Ser Val Val Ser 
85 90 95 



Phe Ser Asn Lys Tyr Asp Glu Val Val Thr Pro Ala He Gly Gly Thr 
100 105 110 



Leu Asn Ala Leu Arg Ala Ala Ala Ala Thr Pro Ser Val Lys Arg Phe 
115 120 125 



Val Leu Thr Ser Ser Thr Val Ser Ala Leu He Pro Lys Pro Asn Val 
130 135 140 



Glu Gly He Tyr Leu Asp Glu Lys Ser Trp Asn Leu Glu Ser He Asp 
145 150 155 160 



Lys Ala Lys Thr Leu Pro Glu Ser Asp Pro Gin Lys Ser Leu Trp Val 
165 170 175 



Tyr Ala Ala Ser Lys Thr Glu Ala Glu Leu Ala Ala Trp Lys Phe Met 
180 185 190 



Asp Glu Asn Lys Pro His Phe Thr Leu Asn Ala Val Leu Pro Asn Tyr 
195 200 205 



Thr lie Gly Thr lie Phe Asp Pro Glu Thr Gin Ser Gly Ser Thr Ser 
210 215 220 



Gly Trp Met Met Ser Leu Phe Asn Gly Glu Val Ser Pro Ala Leu Ala 
225 230 235 240 



Leu Met Pro Pro Gin Tyr Tyr Val Ser Ala Val Asp lie Gly Leu Leu 
245 250 255 



His Leu Gly Cys Leu Val Leu Pro Gin lie Glu Arg Arg Arg Val Tyr 
260 265 270 



Gly Thr Ala Gly Thr Phe Asp Trp Asn Thr Val Leu Ala Thr Phe Arg 
275 280 285 



Lys Leu Tyr Pro Ser Lys Thr Phe Pro Ala Asp Phe Pro Asp Gin Gly 
290 295 300 



Gin Asp Leu Ser Lys Phe Asp Thr Ala Pro Ser Leu Glu lie Leu Lys 
305 310 315 320 



Ser Leu Gly Arg Pro Gly Trp Arg Ser lie Glu Glu Ser lie Lys Asp 
325 330 335 



Leu Val Gly Ser Glu Thr Ala 
340 



<210> 38 

<211> 942 

<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 38 

atgggtcctg ctacgttaaa gaattcttct gctacattaa aactaaatac tggtgcctcc 



60 







• 






• 




attccaatcrt 


taaatttcaa 


cacttggcgt 


tccgttgaca 


ataacggtta 


ccattctgta 


120 


attacaactt 


tcraaaactaa 


atacagacac 


attgatgctg 


cggctatcta 


tttgaatgaa 


180 


gaagaagttg 


gcagggctat 


taaagattcc 


ggagtccctc 


gtgaggaaat 


ttttattact 


240 


actaagcttt 




acaacgtgat 


ccQcraacrctci 


ctctaaacaa 


gtctttgaaa 


300 


agac taggct 


tcraafctatat 


tgacc tatat 


ctgatgcatt 


qqccaqtcrcc 


tttgaaaacc 


360 


gacagagtta 


ctcrataotaa 


cgt tc tgtgc 


attccaacat 


tagaagatgg 


cactgttgac 


420 


atcaatacta 


aaaaatoaaa 


ttt tatcaag 




tgatgcaaga 


gttgccaaag 


480 


acaaacaaaa 

V* ~| fc* W* £■* V* 


ctaaaoccat 


tggtgtc tct 


aatttttcta 


ttaacaacat 


taaagaatta 


540 


ttaaaatctc 


caaataacaa 


ggtggtacca 


gctactaatc 


aaattgaaat 


tcatccattg 


600 


ctaccacaag 


acgaattgat 


tgccttttgt 


aaggaaaagg 


gtattgttgt 


tgaagcctac 


660 


tcaccafct tcr 


crcracrtoctaa 


tgctccttta 


c taaaagagc 


aagcaattat 


tgatatggct 


720 


aaaaaacaco 


acat tcraacc 


agcacagctt 


at tatcagtt 


ggagtattca 


aagaggctac 


780 


at tattctacr 


ccaaatcaat 


taatcctgaa 


agaattgtat 


ccaattttaa 


gattttcact 


840 


ctocctaaaa 


atgatttcaa 


gactattagt 


aacctatcca 


aagtgcatgg 


tacaaagaga 


900 


gtcgttgata 


tgaagtgggg 


atccttccca 


attttccaat 


ga 




942 



<210> 39 
<211> 313 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 39 

Met Gly Pro Ala Thr Leu Lys Asn Ser Ser Ala Thr Leu Lys Leu Asn 
15 10 15 

Thr Gly Ala Ser lie Pro Val Leu Gly Phe Gly Thr Trp Arg Ser Val 
20 25 30 

Asp Asn Asn Gly Tyr His Ser Val lie Ala Ala Leu Lys Ala Gly Tyr 
35 40 45 

Arg His lie Asp Ala Ala Ala lie Tyr Leu Asn Glu Glu Glu Val Gly 
50 55 60 



Arg Ala lie Lys Asp Ser Gly Val Pro Arg Glu Glu lie Phe lie Thr 
65 . 70 75 80 



Thr Lys Leu Trp Gly Thr Glu Gin Arg Asp Pro Glu Ala Ala Leu Asn 
85 90 95 



Lys Ser Leu Lys Arg Leu Gly Leu Asp Tyr Val Asp Leu Tyr Leu Met 
100 105 110 



His Trp Pro Val Pro Leu Lys Thr Asp Arg Val Thr Asp Gly Asn Val 
115 120 125 



Leu Cys lie Pro Thr Leu Glu Asp Gly Thr Val Asp lie Asp Thr Lys 
130 135 140 



Glu Trp Asn Phe lie Lys Thr Trp Glu Leu Met Gin Glu Leu Pro Lys 
145 150 155 160 



Thr Gly Lys Thr Lys Ala Val Gly Val Ser Asn Phe Ser lie Asn Asn 
165 170 175 



lie Lys Glu Leu Leu Glu Ser Pro Asn Asn Lys Val Val Pro Ala Thr 
180 185 190 



Asn Gin He Glu He His. Pro Leu Leu Pro Gin Asp Glu Leu He Ala 
195 200 205 



Phe Cys Lys Glu Lys Gly He Val Val Glu Ala Tyr Ser Pro Phe Gly 
210 215 220 



Ser Ala Asn Ala Pro Leu Leu Lys Glu Gin Ala He He Asp Met Ala 
225 230 235 240 



Lys Lys His Gly Val Glu Pro Ala Gin Leu He He Ser Trp Ser He 
245 250 255 



Gin Arg Gly Tyr Val Val Leu Ala Lys Ser Val Asn Pro Glu Arg He 
260 265 270 



Val Ser Asn Phe Lys He Phe Thr Leu Pro Glu Asp Asp Phe Lys Thr 
275 280 285 



He Ser Asn Leu Ser Lys Val His Gly Thr Lys Arg Val Val Asp Met 
290 295 300 



Lys Trp Gly Ser Phe Pro lie Phe Gin 
305 310 



<210> 40 
<211> 942 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Saccharomyces cerevisiae YPR1 putativ 
e reductase, having glycine codon inserted after the initiating m 
ethionine codon 



<400> 40 
atgggtccgg 


caactctgaa 


gaactcttct 


gcaactctga 


aactgaacac 


tggtgcatct 


60 


atcccggttc 


tgggtttcgg 


tacttggcgt 


tctgttgaca 


acaacggtta 


ccactccgtt 


120 


atcgcagcac 


tgaaagcagg 


ttaccgtcac 


atcgacgcag 


cagcaatcta 


cctgaacgaa 


180 


gaagaagtag 


gtcgtgcaat 


caaagactcc 


ggtgttccgc 


gtgaagaaat 


ctttatcact 


240 


actaaactgt 


ggggtactga 


acagcgtgac 


ccggaagcag 


cactgaacaa 


atctctgaaa 


300 


cgcc ugggcc 




/— r -j +- (^r fap 
cty dLL Ly U ct l_ 






v CL CLClClv 1— 


360 


gaccgtgtta 


ctgatggtaa 


cgttctgtgt 


attccgactc 


tggaagacgg 


tactgtagac 


420 


atcgacacta 


aggaatggaa 


cttcatcaag 


acttgggaac 


tgatgcagga 


actgccgaaa 


480 


actggtaaaa 


ctaaagcagt 


aggtgtttcc 


aacttctcta 


tcaacaacat 


caaagaactg 


540 


ctggaatctc 


cgaacaacaa 


agtagtaccg 


y Vw- d d ^— l ct a. i_> v_> 


0^-4 d l» ^ CI CI. CI. 1— 


V G*. W V ^ 


600 


ctgccgcagg 


acgaactgat 


cgcattctgc 


aaagagaaag 


gtatcgtagt 


agaagcatac 


660 


tctccgttcg 


gctctgcaaa 


cgcaccgctg 


ctgaaagaac 


aggcaatcat 


cgacatggca 


720 


aagaaacacg 


gtgtagaacc 


ggcacagctg 


atcatctctt 


ggtctatcca 


gcgtggttac 


780 


gtagtactgg 


caaaatctgt 


aaacccggaa 


cgtatcgtat 


ctaacttcaa 


aatcttcact 


840 


ctgccggaag 


acgacttcaa 


aactatctct 


aacctgtcca 


aagttcacgg 


tactaaacgt 


900 


gtagtagaca 


tgaaatgggg 


ttctttcccg 


atcttccagt 


aa 




942 


<210> 41 
<211> 939 
<212> DNA 

<213> Saccharomyces cerevisiae 










<400> 41 
atgcctgcta 


ctttacatga 


ttctacgaaa 


atcctttctc 


taaatactgg 


agcccaaatc 


60 







• 






• 




CC tC cL cLd.td.Cf 


gt ttaggtac 


ataacaatCQ 


aaagagaacg 


atgcttataa 


ggctgtttta 


120 


arcactttaa 


aaaa tcrcrc t a 


ccgacacat t 


gatactgetg 


ctatttaccg 


taatgaagac 


180 


caaatcaatc 


aagccat caa 


aaattcaaat 


attCCtCQQQ 


aagaaatctt 


tgttactaca 


240 


aaattataat 


gt acacaaca 


ccacgaacct 


aaaataacQC 


tggatcaatc 


actaaagagg 


300 


ttaaaattaa 


actacataaa 


cttatatt tg 


atacattaac 


ctgecagatt 


agatccagcc 


360 


tacatcaaaa 


atgaagacat 


cttgagtgtg 


ccaacaaaga 


aqqataattc 


tcqtqcaqtq 


420 


aatatcacca 


attaaaat tt 


catcaaaacc 


tgggaattaa 


tgcaggaact 


accaaagact 


480 


aataaaacfca 


aaac cat t era 
a yy*- u y u u yy 


agtctccaac 


ttttctataa 


ataacctgaa 


agatctatta 


540 


acatctcaaa 


gtaataagct 


tacgccagct 


gctaaccaag 


tcgaaataca 


tccattacta 


600 


cc tcaaaaca 


aattgattaa 


t tt ttgtaaa 


agtaaaggca 


ttqtqqttqa 


agcttattct 


660 


ccattaaata 


gtaccgatgc 


tccactattg 


aaggaacegg 


ttatccttga 


aattgcgaag 


720 


aaaaataacc 


ttcaacccgg 


acacgt tgtt 


attagctggc 


aegtccaaag 


aggttatgtt 


780 


gtc t tgccaa 


aatctgtgaa 


tcccgatcga 


atcaaaacga 


acaggaaaat 


atttactttg 


840 


tctac tgagg 


actttgaagc 


tatcaataac 


atatcgaagg 


aaaagggega 


aaaaagggtt 


900 


gtacatccaa 


attggtctcc 


tttcgaagta 


ttcaagtaa 






939 



<210> 42 
<211> 312 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 42 

Met Pro Ala Thr Leu His Asp Ser Thr Lys lie Leu Ser Leu Asn Thr 
15 10 15 

Gly Ala Gin He Pro Gin He Gly Leu Gly Thr Trp Gin Ser Lys Glu 
20 25 30 

Asn Asp Ala Tyr Lys Ala Val Leu Thr Ala Leu Lys Asp Gly Tyr Arg 
35 40 45 

His He Asp Thr Ala Ala He Tyr Arg Asn Glu Asp Gin Val Gly Gin 
50 55 60 



Ala He Lys Asp Ser Gly Val Pro Arg Glu Glu He Phe Val Thr Thr 
65 70 75 80 



Lys Leu Trp Cys Thr Gin His His Glu Pro Glu Val Ala Leu Asp Gin 
85 90 95 



Ser Leu Lys Arg Leu Gly Leu Asp Tyr Val Asp Leu Tyr Leu Met His 
100 105 110 



Trp Pro Ala Arg Leu Asp Pro Ala Tyr lie Lys Asn Glu Asp lie Leu 
115 120 125 



Ser Val Pro Thr Lys Lys Asp Gly Ser Arg Ala Val Asp lie Thr Asn 
130 135 140 



Trp Asn Phe lie Lys Thr Trp Glu Leu Met Gin Glu Leu Pro Lys Thr 
145 150 155 160 



Gly Lys Thr Lys Ala Val Gly Val Ser Asn Phe Ser lie Asn Asn Leu 
165 170 175 



Lys Asp Leu Leu Ala Ser Gin Gly Asn Lys Leu Thr Pro Ala Ala Asn 
180 185 190 



Gin Val Glu lie His Pro Leu Leu Pro Gin Asp Glu Leu lie Asn Phe 
195 200 205 



Cys Lys Ser Lys Gly lie Val Val Glu Ala Tyr Ser Pro Leu Gly Ser 
210 215 220 



Thr Asp Ala Pro Leu Leu Lys Glu Pro Val lie Leu Glu lie Ala Lys 
225 230 235 240 



Lys Asn Asn Val Gin Pro Gly His Val Val lie Ser Trp His Val Gin 
245 250 255 



Arg Gly Tyr Val Val Leu Pro Lys Ser Val Asn Pro Asp Arg lie Lys 
260 265 270 



Thr Asn Arg Lys lie Phe Thr Leu Ser Thr Glu Asp Phe Glu Ala lie 
275 280 . 285 



Asn Asn lie Ser Lys Glu Lys Gly Glu Lys Arg Val Val His Pro Asn 
290 295 300 



Trp Ser Pro Phe Glu Val Phe Lys 
305 310 



<210> 43 
<211> 942 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Saccharomyces cerevisiae GCY1 reducta 
se, having numerous codons replaced with others encoding the same 
amino acids to reduce the free energy of folding, and a ggc inse 
rtion after the initiating atg 



<400> 43 
atgggcccag 


ctactctgca 


cgactctacc 


aaaattctgt 


ctctgaacac 


cggtgctcaa 


60 


atcccgcaaa 


tcggcctggg 


tacttggcaa 


tctaaagaaa 


acgacgcata 


caaggctgtt 


120 


ctgactgctc 


tgaaggatgg 


ctatcgtcac 


attgatactg 


ctgctattta 


tcgtaacgag 


180 


gaccaggtag 


gtcaggcaat 


caaggactct 


ggcgttccgc 


gtgaggaaat 


cttcgtaact 


240 


accaaactgt 


ggtgcactca 


gcatcatgaa 


ccggaagtag 


cactggatca 


atctctgaag 


300 


cgtctgggtc 


tggactatgt* 


tgatctgtac 


ctgatgcatt 


ggccggcgcg 


cctggaccca 


360 


gcgtatatta 


aaaacgaaga 


tatcctgtct 


gttccgacta 


agaaagacgg 


ctctcgtgct 


420 


gttgacatca 


ctaactggaa 


cttcatcaag 


acctgggaac 


tgatgcagga 


actgccgaag 


480 


actggtaaaa 


ctaaagctgt 


tggcgtatct 


aacttctcca 


tcaacaacct 


gaaggacctg 


540 


ctggcatccc 


agggcaacaa 


gctgactccg 


gctgctaacc 


aagtagagat 


ccacccgctg 


600 


ctgccgcagg 


acgaactgat 


caacttctgt 


aaatctaaag 


gcattgtagt 


tgaagcatat 


660 


tctccgctgg 


gttctaccga 


tgcgccactg 


ctgaaagagc 


cggtaatcct 


ggagatcgcg 


720 


aagaaaaaca 


acgtacaacc 


aggtcatgta 


gtaatctctt 


ggcacgtaca 


gcgcggctac 


780 


gtagttctgc 


cgaagtctgt 


aaacccggat 


cgtatcaaaa 


ctaaccgtaa 


aatctttacc 


840 


ctgtccaccg 


aagatttcga 


agcaatcaac 


aacatctcca 


aggaaaaggg 


cgagaaacgt 


900 


gtagttcacc 


caaactggtc 


cccgtttgaa 


gtattcaagt 


aa 




942 



<210> 44 

<211> 313 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Saccharomyces cerevisiae GCY1 redu 
ctase, having a glycine inserted at position 2 in the amino acid 



sequence 
<400> 44 

Met Gly Pro Ala Thr Leu His Asp Ser Thr Lys lie Leu Ser Leu Asn 
15 10 15 



Thr Gly Ala Gin lie Pro Gin He Gly Leu Gly Thr Trp Gin Ser Lys 
20 25 30 



Glu Asn Asp Ala Tyr Lys Ala Val Leu Thr Ala Leu Lys Asp Gly Tyr 
35 40 45 



Arg His He Asp Thr Ala Ala He Tyr Arg Asn Glu Asp Gin Val Gly 
50 55 60 



Gin Ala He Lys Asp Ser Gly Val Pro Arg Glu Glu He Phe Val Thr 
65 70 75 80 



Thr Lys Leu Trp Cys Thr Gin His His Glu Pro Glu Val Ala Leu Asp 
85 90 95 



Gin Ser Leu Lys Arg Leu Gly Leu Asp Tyr Val Asp Leu Tyr Leu Met 
100 105 110 



His Trp Pro Ala Arg Leu Asp Pro Ala Tyr He Lys Asn Glu Asp He 
115 120 125 



Leu Ser Val Pro Thr Lys Lys Asp Gly Ser Arg Ala Val Asp He Thr 
130 135 140 



Asn Trp Asn Phe He Lys Thr Trp Glu Leu Met Gin Glu Leu Pro Lys 
145 150 155 160 



Thr Gly Lys Thr Lys Ala Val Gly Val Ser Asn Phe Ser He Asn Asn 
165 170 175 



Leu Lys Asp Leu Leu Ala Ser Gin Gly Asn Lys Leu Thr Pro Ala Ala 
180 185 190 



Asn Gin Val Glu He His Pro Leu Leu Pro Gin Asp Glu Leu He Asn 
195 200 205 



Phe Cys Lys Ser Lys Gly He Val Val Glu Ala Tyr Ser Pro Leu Gly 




210 215 220 

Ser Thr Asp Ala Pro Leu Leu Lys Glu Pro Val lie Leu Glu lie Ala 
225 230 235 240 

Lys Lys Asn Asn Val Gin Pro Gly His Val Val lie Ser Trp His Val 
245 250 255 

Gin Arg Gly Tyr Val Val Leu Pro Lys Ser Val Asn Pro Asp Arg lie 
260 265 270 

Lys Thr Asn Arg Lys lie Phe Thr Leu Ser Thr Glu Asp Phe Glu Ala 
275 280 285 

lie Asn Asn lie Ser Lys Glu Lys Gly Glu Lys Arg Val Val His Pro 
290 295 300 

Asn Trp Ser Pro Phe Glu Val Phe Lys 
305 310 

<210> 45 

<211> 1029 

<212> DNA 

<213> Saccharomyces cerevisiae 

<400> 45 



atgtcagttt 


tcgtttcagg 


tgctaacggg 


ttcattgccc 


aacacattgt 


cgatctcctg 


60 


ttgaaggaag 


actataaggt 


catcggttct 


gccagaagtc 


aagaaaaggc 


cgagaattta 


120 


acggaggcct 


ttggtaacaa 


cccaaaattc 


tccatggaag 


ttgtcccaga 


catatctaag 


180 


ctggacgcat 


ttgaccatgt 


tttccaaaag 


cacggcaagg 


atatcaagat 


agttctacat 


240 


acggcctctc 


cattctgctt 


tgatatcact 


gacagtgaac 


gcgatttatt 


aattcctgct 


300 


gtgaacggtg 


ttaagggaat 


tctccactca 


attaaaaaat 


acgccgctga 


ttctgtagaa 


360 


cgtgtagttc 


tcacctcttc 


ttatgcagct 


gtgttcgata 


tggcaaaaga 


aaacgataag 


420 


tctttaacat 


ttaacgaaga 


atcctggaac 


ccagctacct 


999 a gagttg 


ccaaagtgac 


480 


ccagttaacg 


cctactgtgg 


ttctaagaag 


tttgctgaaa 


aagcagcttg 


ggaatttcta 


540 


gaggagaata 


gagactctgt 


aaaattcgaa 


ttaactgccg 


ttaacccagt 


ttacgttttt 


600 


ggtccgcaaa 


tgtttgacaa 


agatgtgaaa 


aaacacttga 


acacatcttg 


cgaactcgtc 


660 


aacagcttga 


tgcatttatc 


accagaggac 


aagataccgg 


aactatttgg 


tggatacatt 


720 







• 






• 




era t"a t* t" cat' a 


ai~Cfi~t~Cff' , aaa 
Quy i» t^y i« acici 


aa r*t~ r*a t" t~ t - a 


attaccttcc 

y i> ^-y w 


aaaaaaaaaa 


aacaattaot 


780 


Wuu CI ^> t— CI CI 


i_ v_ y v_ c*. Lv^yy q 


aarraoattt 

y y ^ucivj ci i_ i_ 


ci> \~ « *»y ^— ' ~*y j 


atattctcaa 


tatccttaac 


840 


y ci ciy cx v_ i_ i_ v— 


v- i— y v_ V— v_ l> a. 0 a. 


a crap aat*att 

ay y act lclll 


ccaataaaaa 
^uexy i-yyyy a 


aaccacrcrfc t.c 

^4 \*> ^4 V*- 


taatoctacc 


900 


mfcaacaccc 




1" c t - 1* a a t a a t 


aaaaaaaata 


acxaaat tatt 


aaatt tcaaa 


960 


f- /-^ o nn 

i_ i_\_ciyyci.ci\-. l. 


Uy CLCLCiy ay CLl* 




actocctccc 


aaattttaaa 


atttaaaaac 


1020 


agaatataa 
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<210> 46 

<211> 342 

<212> PRT 

<213> Saccharomyces cerevisiae 

<400> 46 

Met Ser Val Phe Val Ser Gly Ala Asn Gly Phe lie Ala Gin His lie 
1 5 10 15 



Val Asp Leu Leu Leu Lys Glu Asp Tyr Lys Val lie Gly Ser Ala Arg 
20 25 30 



Ser Gin Glu Lys Ala Glu Asn Leu Thr Glu Ala Phe Gly Asn Asn Pro 
35 40 45 



Lys Phe Ser Met Glu Val Val Pro Asp He Ser Lys Leu Asp Ala Phe 
50 55 60 



Asp His Val Phe Gin Lys His Gly Lys Asp He Lys He Val Leu His 
65 70 75 80 



Thr Ala Ser Pro Phe Cys Phe Asp He Thr Asp Ser Glu Arg Asp Leu 
85 90 95 



Leu lie Pro Ala Val Asn Gly Val Lys Gly He Leu His Ser He Lys 
100 105 110 - 



Lys Tyr Ala Ala Asp Ser Val Glu Arg Val Val Leu Thr Ser Ser Tyr 
115 120 125 



Ala Ala Val Phe Asp Met Ala Lys Glu Asn Asp Lys Ser Leu Thr Phe 
130 135 140 



Asn Glu Glu Ser Trp Asn Pro Ala Thr Trp Glu Ser Cys Gin Ser Asp 



145 150 155 160 



Pro Val Asn Ala Tyr Cys Gly Ser Lys Lys Phe Ala Glu Lys Ala Ala 
165 170 175 



Trp Glu Phe Leu Glu Glu Asn Arg Asp Ser Val Lys Phe Glu Leu Thr 
180 185 190 



Ala Val Asn Pro Val Tyr Val Phe Gly Pro Gin Met Phe Asp Lys Asp 
195 200 205 



Val Lys Lys His Leu Asn Thr Ser Cys Glu Leu Val Asn Ser Leu Met 
210 215 220 



His Leu Ser Pro Glu Asp Lys lie Pro Glu Leu Phe Gly Gly Tyr lie 
225 230 235 240 



Asp Val Arg Asp Val Ala Lys Ala His Leu Val Ala Phe Gin Lys Arg 
245 250 255 



Glu Thr lie Gly Gin Arg Leu lie Val Ser Glu Ala Arg Phe Thr Met 
260 265 270 



Gin Asp Val Leu Asp lie Leu Asn Glu Asp Phe Pro Val Leu Lys Gly 
275 280 285 



Asn lie Pro Val Gly Lys Pro Gly Ser Gly Ala Thr His Asn Thr Leu 
290 295 300 



Gly Ala Thr Leu Asp Asn Lys Lys Ser Lys Lys Leu Leu Gly Phe Lys 
305 310 315 320 



Phe Arg Asn Leu Lys Glu Thr lie Asp Asp Thr A'la Ser Gin lie Leu 
325 330 335 



Lys Phe Glu Gly Arg lie 
340 



<210> 47 

<211> 1032 

<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Synthetic gene derived from Saccharomyces cerevisiae GRE2 reducta 
se, having numerous codons replaced with others encoding the same 
amino acids to reduce the free energy of folding, arid a ggc inse 
rtion at position 2 in the amino acid sequence 



<400> 47 



atgggctctg 


tatttgtatc 


tggcgctaac 


ggttttatcg 


ctcaacacat 


cgtcgatctg 


60 


ctgctgaaag 


aagattacaa 


agttatcggt 


tccgcacgtt 


cccaggaaaa 


agctgaaaac 


120 


ctgactgaag 


catttggtaa 


caacccgaag 


ttctctatgg 


aagtagtacc 


ggacatttct 


180 


aaactggacg 


cattcgacca 


cgtattccaa 


aagcacggta 


aggatatcaa 


gatcgtactg 


240 


cacactgcat 


ctccattctg 


ttttgacatc 


actgattctg 


agcgcgacct 


gctgattccg 


300 


gctgttaacg 


gtgttaaagg 


tattctgcac 


tctattaaga 


aatatgctgc 


tgattccgta 


360 


gaacgcgtag 


ttctgacttc 


ctcttatgct 


gcagtattcg 


atatggctaa 


agagaacgac 


420 


aaatccctga 


cttttaacga 


agaatcttgg 


aacccggcta 


cctgggaatc 


ttgccagtct 


480 


gacccggtta 


acgcttattg 


tggctctaag 


aagtttgctg 


aaaaagctgc 


ttgggaattc 


540 


ctggaagaaa 


accgtgactc 


tgtaaagttc 


gagctgaccg 


ctgtaaaccc 


ggtatacgtt 


600 


tttggcccgc 


agatgttcga 


taaagatgta 


aagaagcacc 


tgaacacttc 


ctgtgaactg 


660 


gtaaactct c 


tgatgcacct 


gtctccagaa 


gataaaatcc 


cggagctgtt 

ZJ ZJ ZJ ZJ 


cggcggttac 


720 


atcgacgttc 


gtgacgtagc 


aaaagcacat 


ctggtagctt 


tccagaagcg 


tgagactatc 


780 


ggccagcgtc 


tgattgtttc 


cgaggctcgt 


ttcaccatgc 


aggatgttct 


ggatattctg 


840 


aacgaagact 


tcccggtact 


gaaaggtaac 


attccggtgg 


gtaaaccagg 


ctctggtgca 


900 


actcataaca 


ctctgggtgc 


aactctggat 


aacaagaagt 


ctaagaaact 


gctgggtttt 


960 


aaattccgta 


acctgaaaga 


aactattgac 


gacactgcat 


ctcagatcct 


gaaattcgaa 


1020 


ggtcgcatct 


aa 










1032 



<210> 48 
<211> 343 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Saccharomyces cerevisiae Gre2 redu 
ctase, having a glycine inserted at position 2 in the amino acid 
sequence 

<400> 48 



Met Gly Ser Val Phe Val Ser Gly Ala Asn Gly Phe lie Ala Gin His 
1 5 .10 15 



lie Val Asp Leu Leu Leu Lys Glu Asp Tyr Lys Val lie Gly Ser Ala 
20 25 30 



Arg Ser Gin Glu Lys Ala Glu Asn Leu Thr Glu Ala Phe Gly Asn Asn 
35 40 45 



Pro Lys Phe Ser Met Glu Val Val Pro Asp lie Ser Lys Leu Asp Ala 
50 55 60 



Phe Asp His Val Phe Gin Lys His Gly Lys Asp lie Lys lie Val Leu 
65 70 75 80 



His Thr Ala Ser Pro Phe Cys Phe Asp lie Thr Asp Ser Glu Arg Asp 
85 90 95 



Leu Leu lie Pro Ala Val Asn Gly Val Lys Gly lie Leu His Ser lie 
100 105 110 



Lys Lys Tyr Ala Ala Asp Ser Val Glu Arg Val Val Leu Thr Ser Ser 
115 120 125 



Tyr Ala Ala Val Phe Asp Met Ala Lys Glu Asn Asp Lys Ser Leu Thr 
130 135 140 



Phe Asn Glu Glu Ser Trp Asn Pro Ala Thr Trp Glu Ser Cys Gin Ser 
145 150 155 160 



Asp Pro Val Asn Ala Tyr Cys Gly Ser Lys Lys Phe Ala Glu Lys Ala 
165 170 175 



Ala Trp Glu Phe Leu Glu Glu Asn Arg Asp Ser Val Lys Phe Glu Leu 
180 185 190 



Thr Ala Val Asn Pro Val Tyr Val Phe Gly Pro Gin Met Phe Asp Lys 
195 200 205 



Asp Val Lys Lys His Leu Asn Thr Ser Cys Glu Leu Val Asn Ser Leu 
210 215 220 



Met His Leu Ser Pro Glu Asp Lys lie Pro Glu Leu Phe Gly Gly Tyr 
225 230 235 240 



He Asp Val Arg Asp Val Ala Lys Ala His Leu Val Ala Phe Gin Lys 
245 250 255 



Arg Glu Thr He Gly Gin Arg Leu He Val Ser Glu Ala Arg Phe Thr 
260 265 270 



Met Gin Asp Val Leu Asp He Leu Asn Glu Asp Phe Pro Val Leu Lys 
275 280 285 



Gly Asn He Pro Val Gly Lys Pro Gly Ser Gly Ala Thr His Asn Thr 
290 295 300 



Leu Gly Ala Thr Leu Asp Asn Lys Lys Ser Lys Lys Leu Leu Gly Phe 
305 310 315 320 



Lys Phe Arg Asn Leu Lys Glu Thr He Asp Asp Thr Ala Ser Gin He 
325 330 335 



Leu Lys Phe Glu Gly Arg He 
340 



<210> 49 
<211> 987 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Essentially encodes the native GRE3 reductase gene from Saccharom 
yces cerevisiae, but has a codon for isoleucine inserted after th 
e initiating methionine codon to incorporate a restriction site 



<400> 49 
atgatttctt 


cactggttac 


tcttaataac 


ggtctgaaaa 


tgcccctagt 


cggcttaggg 


60 


tgctggaaaa 


ttgacaaaaa 


agtctgtgcg 


aatcaaattt 


atgaagctat 


caaattaggc 


120 


taccgtttat 


tcgatggtgc 


ttgcgactac 


ggcaacgaaa 


aggaagttgg 


tgaaggtatc 


180 


aggaaagcca 


tctccgaagg 


tcttgtttct 


agaaaggata 


tatttgttgt 


ttcaaagtta 


240 


tggaacaatt 


ttcaccatcc 


tgatcatgta 


aaattagctt 


taaagaagac 


cttaagcgat 


300 


atgggacttg 


attatttaga 


cctgtattat 


attcacttcc 


caatcgcctt 


caaatatgtt 


360 


ccatttgaag 


agaaataccc 


tccaggattc 


tatacgggcg 


cagatgacga 


gaagaaaggt 


420 


cacatcaccg 


aagcacatgt 


accaatcata 


gatacgtacc 


gggctctgga 


agaatgtgtt 


480 


gatgaaggct 


tgattaagtc 


tattggtgtt 


tccaactttc 


agggaagctt 


gattcaagat 


540 



ttattacgtg gttgtagaat caagcccgtg gctttgcaaa ttgaacacca tccttatttg 600 

actcaagaac acctagttga gttttgtaaa ttacacgata tccaagtagt tgcttactcc 660 

tccttcggtc ctcaatcatt cattgagatg gacttacagt tggcaaaaac cacgccaact 720 

ctgttcgaga atgatgtaat caagaaggtc tcacaaaacc atccaggcag taccacttcc 780 

caagtattgc ttagatgggc aactcagaga ggcattgccg tcattccaaa atcttccaag 84 0 

aaggaaaggt tacttggcaa cctagaaatc gaaaaaaagt tcactttaac ggagcaagaa 900 

ttgaaggata tttctgcact aaatgccaac atcagattta atgatccatg gacctggttg 960 

gatggtaaat tccccacttt tgcctga 987 



<210> 50 
<211> 328 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Essentially encodes the native GRE3 reductase protein from Saccha 
romyces cerevisiae, but has an isoleucine inserted after the init 
iating methionine 

<400> 50 

Met lie Ser Ser Leu Val Thr Leu Asn Asn Gly Leu Lys Met Pro Leu 
15 10 15 



Val Gly Leu Gly Cys Trp Lys lie Asp Lys Lys Val Cys Ala Asn Gin 
20 25 30 



He Tyr Glu Ala He Lys Leu Gly Tyr Arg Leu Phe Asp Gly Ala Cys 
35 40 45 



Asp Tyr Gly Asn Glu Lys Glu Val Gly Glu Gly lie Arg Lys Ala He 
50 55 60 



Ser Glu Gly Leu Val Ser Arg Lys Asp He Phe Val Val Ser Lys Leu 
65 70 75 80 



Trp Asn Asn Phe His His Pro Asp His Val Lys Leu Ala Leu Lys Lys 
85 90 95 



Thr Leu Ser Asp Met Gly Leu Asp Tyr Leu Asp Leu Tyr Tyr He His 
100 105 110 



Phe Pro lie Ala Phe Lys Tyr Val Pro Phe Glu Glu Lys Tyr Pro Pro 
115 120 125 



Gly Phe Tyr Thr Gly Ala Asp Asp Glu Lys Lys Gly His lie Thr Glu 
130 135 140 



Ala His Val Pro lie lie Asp Thr Tyr Arg Ala Leu Glu Glu Cys Val 
145 150 155 160 



Asp Glu Gly Leu lie Lys Ser lie Gly Val Ser Asn Phe Gin Gly Ser 
165 170 175 



Leu lie Gin Asp Leu Leu Arg Gly Cys Arg lie Lys Pro Val Ala Leu 
180 185 190 



Gin lie Glu His His Pro Tyr Leu Thr Gin Glu His Leu Val Glu Phe 
195 200 205 



Cys Lys Leu His Asp lie Gin Val Val Ala Tyr Ser Ser Phe Gly Pro 
210 215 220 



Gin Ser Phe lie Glu Met Asp Leu Gin Leu Ala Lys Thr Thr Pro Thr 
225 230 235 240 



Leu Phe Glu Asn Asp Val lie Lys Lys Val Ser Gin Asn His Pro Gly 
245 250 255 



Ser Thr Thr Ser Gin Val Leu Leu Arg Trp Ala Thr Gin Arg Gly lie 
260 265 270 



Ala Val lie Pro Lys Ser Ser Lys Lys Glu Arg Leu Leu Gly Asn Leu 
275 280 285 



Glu lie Glu Lys Lys Phe Thr Leu Thr Glu Gin Glu Leu Lys Asp lie 
290 295 300 



Ser Ala Leu Asn Ala Asn lie Arg Phe Asn Asp Pro Trp Thr Trp Leu 
305 310 315 320 



Asp Gly Lys Phe Pro Thr 
325 



Phe Ala 



<211> 987 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Saccharomyces cerevisiae GRE3 reducta 
se, having numerous codons replaced with others encoding the same 
amino acids to reduce the free energy of folding, and a ggc codo 
n insertion after the initiating atg 



<400> 51 
atgggctctt 


ctctggtaac 


tctgaacaac 


ggtctgaaaa 


tgccgctggt 


aggcctgggc 


60 


tgctggaaaa 


tcgataagaa 


agtatgtgct 


aaccaaattt 


atgaggctat 


caaactgggc 


120 


tatcgcctgt 


tcgacggtgc 


ttgcgactat 


ggtaacgaga 


aggaagttgg 


tgaaggcatc 


180 


cgtaaagcta 


tctctgaagg 


tctggtatct 


cgtaaggata 


tctttgtagt 


atctaagctg 


240 


tggaacaact 


ttcatcaccc 


ggatcacgta 


aaactggcac 


tgaagaaaac 


cctgtctgat 


300 


atgggtctgg 


attatctgga 


tctgtactat 


atccactttc 


cgatcgcatt 


taaatacgta 


360 


ccgttcgaag 


aaaaatatcc 


gccgggcttt 


tacactggtg 


cagacgacga 


aaagaagggt 


420 


cacatcactg 


aagctcacgt 


accgatcatc 


gacacttacc 


gtgctctgga 


ggaatgtgta 


480 


gacgaaggtc 


tgatcaaatc 


tatcggtgta 


tctaacttcc 


agggttctct 


gatccaggat 


540 


ctgctgcgtg 


gttgccgtat 


caagccggtt 


gctctgcaaa 


ttgaacacca 


cccgtacctg 


600 


acccaggaac 


acctggttga 


attctgcaaa 


ctgcacgata 


tccaagtagt 


agcatactct 


660 


tctttcggtc 


cgcagtcttt 


catcgaaatg 


gacctgcagc 


tggctaagac 


caccccgact 


720 


ctgttcgaaa 


acgacgtaat 


caagaaagta 


tctcagaacc 


acccgggctc 


tactacctct 


780 


caggtactgc 


tgcgttgggc 


tactcagcgt 


ggcatcgctg 


ttatcccgaa 


atcttctaag 


840 


aaagaacgtc 


tgctgggtaa 


cctggaaatc 


gaaaagaaat 


tcactctgac 


cgaacaggaa 


900 


ctgaaagata 


tctctgctct 


gaacgctaac 


atccgtttca 


acgatccgtg 


gacctggctg 


960 


gatggtaaat 


tcccgacttt 


cgcttaa 








987 



<210> 52 
<211> 328 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Saccharomyces cerevisiae Gre3 redu 
ctase, having a glycine inserted at position 2 in the amino acid 
sequence 



<400> 52 

Met Gly Ser Ser Leu Val Thr Leu Asn Asn Gly Leu Lys Met Pro Leu 
15 10 15 



Val Gly Leu Gly Cys Trp Lys lie Asp Lys Lys Val Cys Ala Asn Gin 
20 25 30 



lie Tyr Glu Ala lie Lys Leu Gly Tyr Arg Leu Phe Asp Gly Ala Cys 
35 40 45 



Asp Tyr Gly Asn Glu Lys Glu Val Gly Glu Gly lie Arg Lys Ala lie 
50 55 60 



Ser Glu Gly Leu Val Ser Arg Lys Asp lie Phe Val Val Ser Lys Leu 
65 70 75 80 



Trp Asn Asn Phe His His Pro Asp His Val Lys Leu Ala Leu Lys Lys 
85 90 95 



Thr Leu Ser Asp Met Gly Leu Asp Tyr Leu Asp Leu Tyr Tyr He His 
100 105 110 



Phe Pro He Ala Phe Lys Tyr Val Pro Phe Glu Glu Lys Tyr Pro Pro 
115 120 125 



Gly Phe Tyr Thr Gly Ala Asp Asp Glu Lys Lys Gly His He Thr Glu 
130 135 140 



Ala His Val Pro He He Asp Thr Tyr Arg Ala Leu Glu Glu Cys Val 
145 150 155 160 



Asp Glu Gly Leu He Lys Ser He Gly Val Ser Asn Phe Gin Gly Ser 
165 170 175 



Leu He Gin Asp Leu Leu Arg Gly Cys Arg He Lys Pro Val Ala Leu 
180 185 190 



Gin He Glu His His Pro Tyr Leu Thr Gin Glu His Leu Val Glu Phe 
195 200 205 



Cys Lys Leu. His Asp He Gin Val Val Ala Tyr 
210 215 



Ser Ser Phe Gly Pro 
220 



Gin Ser Phe lie Glu Met Asp Leu Gin Leu Ala Lys Thr Thr Pro Thr 
225 230 235 240 



Leu Phe Glu Asn Asp Val lie Lys Lys Val Ser Gin Asn His Pro Gly 
245 250 255 



Ser Thr Thr Ser Gin Val Leu Leu Arg Trp Ala Thr Gin Arg Gly lie 
260 265 270 



Ala Val lie Pro Lys Ser Ser Lys Lys Glu Arg Leu Leu Gly Asn Leu 
275 280 285 



Glu lie Glu Lys Lys Phe Thr Leu Thr Glu Gin Glu Leu Lys Asp lie 
290 295 300 



Ser Ala Leu Asn Ala Asn lie Arg Phe Asn Asp Pro Trp Thr Trp Leu 
305 310 315 320 



Asp Gly Lys Phe Pro Thr Phe Ala 
325 



<210> 53 

<211> 1461 

<212> DNA 

<213> Sus scrofa 



<400> 53 
atgaatgcca 


gcgatttccg 


tcgacgcggc 


aaagaaatgg 


tggattacat 


ggcggattac 


60 


ctggaaggca 


tcgaaggtcg 


tcaggtgtac 


ccggatgtgc 


agccggggta 


cctgcgtccg 


120 


ctgatcccgg 


cgaccgcccc 


gcaggaaccg 


gataccttcg 


aagatatcct 


gcaggatgtg 


180 


gaaaaaatca 


tcatgccggg 


ggtgacccac 


tggcacagcc 


cgtacttctt 


cgcgtacttc 


240 


ccgaccgcca 


gcagctaccc 


ggcgatgctg 


gcggatatgc 


tgtgcggtgc 


gatcggatgc 


300 


atcggtttca 


gctgggcggc 


tagcccggcg 


tgcaccgaac 


tcgagaccgt 


gatgatggat 


360 


tggctgggca 


aaatgctcca 


gcttccggaa 


gcgttcctgg 


cgggcgaagc 


cggtgaaggc 


420 


ggcggcgtga 


tccagggtag 


cgccagcgaa 


gccaccctgg 


tggcgctgct 


ggcggcgcgt 


480 


accaaagtgg 


tgcgacgtct 


gcaagcggcg 


agcccgggcc 


tgacccaggg 


cgcggtgctg 


540 


gaaaaactag 


tggcgtacgc 


gagtgatcag 


gcgcacagca 


gcgtggaacg 


tgccggcctg 


600 


atcggcggcg 


tgaaactgaa 


agcgatcccg 


agcgatggca 


aattcgcgat 


gcgtgcgagc 


660 







• 






• 


gcgctgcagg 


aggccctgga 


gagagacaag 


gctgccggcc 


tgattccttt 


cttcgtggtg 


gctacgctgg 


ggaccacatc 


gtgctgctcc 


tttgacaatc 


tcttagaagt 


gggacccatc 


tgtcacgaag 


aggacatatg 


gctgcacgtg 


gatgctgcct 


acgcaggcag 


tgccttcatc 


tgccctgagt 


tccggcacct 


gctgaatgga 


gtggagtttg 


cagattcatt 


taactttaat 


ccccacaaat 


ggctcttggt 


gaattttgac 


tgctcggcta 


tgtgggtgaa 


aaggagaacg 


gacctgactg 


gagccttcaa 


attggacccc 


gtgtacttaa 


agcacagcca 


ccagggctcg 


gggcttatca 


cggactacag 


gcactggcag 


ctgccactgg 


gtcggcgatt 


ccggtccctg 


aaaatgtggt 


ttgtttttag 


gatgtacgga 


gtcaagggac 


tgcaggccta 


tatccgcaag 


cacgtgcagc 


tgtctcatga 


gtttgaggca 


tttgtgcttc 


aggatccacg 


ctttgaagtc 


tgtgccgaag 


tcaccctggg 


gctggtgtgt 


ttccggctga 


agggctccga 


cggactgaat 


gaagcgcttc 


tggaaaggat 


aaacagcgcc 


aggaaaatcc 


acttggttcc 


ctgtcgcctg 


aggggccagt 


tcgtgctgcg 


gttcgccatc 


tgctcgcgca 


aggtggagtc 


gggccacgtg 


cggctggcct 


gggagcacat 


ccgagggctg 


gcggccgagc 


tgctggccgc 


ggaggaggga 


aaggcagaga 


tcaaaagttg 


a 









<210> 54 

<211> 486 

<212> PRT 

<213> Sus scrofa 

<400> 54 

Met Asn Ala Ser Asp Phe Arg Arg Arg Gly Lys Glu Met Val Asp Tyr 
15 10 15 



Met Ala Asp Tyr Leu Glu Gly lie Glu Gly Arg Gin Val Tyr Pro Asp 
20 25 30 



Val Gin Pro Gly Tyr Leu Arg Pro Leu lie Pro Ala Thr Ala Pro Gin 
35 40 45 



Glu Pro Asp Thr Phe Glu Asp lie Leu Gin Asp Val Glu Lys lie lie 
50 55 60 



Met Pro Gly Val Thr His Trp His Ser Pro Tyr Phe Phe Ala Tyr Phe 
65 70 75 80 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1461 



Pro Thr Ala Ser Ser Tyr Pro Ala Met Leu Ala Asp Met Leu Cys Gly 



85 



90 



95 



Ala lie Gly Cys lie Gly Phe Ser Trp Ala Ala Ser Pro Ala Cys Thr 
100 105 110 



Glu Leu Glu Thr Val Met Met Asp Trp Leu Gly Lys Met Leu Gin Leu 
115 120 125 



Pro Glu Ala Phe Leu Ala Gly Glu Ala Gly Glu Gly Gly Gly Val lie 
130 135 140 



Gin Gly Ser Ala Ser Glu Ala Thr Leu Val Ala Leu Leu Ala Ala Arg 
145 150 155 160 



Thr Lys Val Val Arg Arg Leu Gin Ala Ala Ser Pro Gly Leu Thr Gin 
165 170 175 



Gly Ala Val Leu Glu Lys Leu Val Ala Tyr Ala Ser Asp Gin Ala His 
180 185 190 



Ser Ser Val Glu Arg Ala Gly Leu lie Gly Gly Val Lys Leu Lys Ala 
195 200 205 



lie Pro Ser Asp Gly Lys Phe Ala Met Arg Ala Ser Ala Leu Gin Glu 
210 215 220 



Ala Leu Glu Arg Asp Lys Ala Ala Gly Leu lie Pro Phe Phe Val Val 
225 230 235 240 



Ala Thr Leu Gly Thr Thr Ser Cys Cys Ser Phe Asp Asn Leu Leu Glu 
245 250 255 



Val Gly Pro lie Cys His Glu Glu Asp lie Trp Leu His Val Asp Ala 
260 265 270 



Ala Tyr Ala Gly Ser Ala Phe lie Cys Pro Glu Phe Arg His Leu Leu 
275 280 285 



Asn Gly Val Glu Phe Ala Asp Ser Phe Asn Phe Asn Pro His Lys Trp 
290 295 300 



Leu Leu Val Asn Phe Asp Cys Ser Ala Met Trp Val Lys Arg Arg Thr 
305 310 315 320 



Asp Leu Thr Gly Ala Phe Lys Leu Asp Pro Val Tyr Leu Lys His Ser 
325 330 335 



His Gin Gly Ser Gly Leu lie Thr Asp Tyr Arg His Trp Gin Leu Pro 
340 345 350 



Leu Gly Arg Arg Phe Arg Ser Leu Lys Met Trp Phe Val Phe Arg Met 
355 360 365 



Tyr Gly Val Lys Gly Leu Gin Ala Tyr lie Arg Lys His Val Gin Leu 
370 375 380 



Ser His Glu Phe Glu Ala Phe Val Leu Gin Asp Pro Arg Phe Glu Val 
385 390 395 400 



Cys Ala Glu Val Thr Leu Gly Leu Val Cys Phe Arg Leu Lys Gly Ser 
405 410 415 



Asp Gly Leu Asn Glu Ala Leu Leu Glu Arg He Asn Ser Ala Arg Lys 
420 425 430 



He His Leu Val Pro Cys Arg Leu Arg Gly Gin Phe Val Leu Arg Phe 
435 440 445 



Ala He Cys Ser Arg Lys Val Glu Ser Gly His Val Arg Leu Ala Trp 
450 455 460 



Glu His He Arg Gly Leu Ala Ala Glu Leu Leu Ala Ala Glu Glu Gly 
465 470 475 480 



Lys Ala Glu He Lys Ser 
485 



<210> 55 
<211> 1464 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Sus scrofa L-aromatic amino acid deca 
rboxylase, having numerous codons replaced with others encoding t 
he same amino acids to reduce free energy of folding, and a gly c 
odon inserted after the initiating met codon 




<400> 55 
atgggtaacg 


cttccgattt 


ccgtcgtcgt 


ggcaaagaaa 


tggtagacta 


catggcagat 


60 


tatctggaag 


gtatcgaagg 


ccgtcaagtt 


tacccggacg 


ttcagccagg 


ctatctgcgt 


120 


ccgctcatcc 


cagctaccgc 


accgcaagaa 


ccggacacct 


ttgaagacat 


cctgcaagac 


180 


gtagaaaaga 


tcatcatgcc 


aggtgtaacc 


cactggcact 


ctccgtactt 


tttcgcatac 


240 


ttcccgactg 


catcctccta 


cccggctatg 


ctggctgaca 


tgctgtgtgg 


tgctatcggc 


300 


tgtatcggct 


tttcctgggc 


tgcatctccg 


gcatgcactg 


agctggaaac 


cgttatgatg 


360 


gattggctgg 


gtaaaatgct 


gcagctgcca 


gaggcatttc 


tggctggtga 


ggctggtgag 


420 


ggtggtggtg 


taattcaagg 


ctctgcgtcc 


gaagctactc 


tggttgctct 


gctggctgct 


480 


cgtactaaag 


ttgttcgtcg 


tctgcaagct 


gcatctccgg 


gtctgactca 


gggtgctgtt 


540 


ctggagaaac 


tggtagcgta 


tgcttctgat 


caggctcact 


cttccgttga 


gcgtgctggt 


600 


ctgattggtg 


gtgttaagct 


gaaagctatt 


ccgtccgatg 


gtaagttcgc 


tatgcgtgca 


660 


tccgctctgc 


aagaagctct 


ggaacgtgac 


aaagctgctg 


gtctgattcc 


gttcttcgtt 


720 


gttgctaccc 


tgggtactac 


ctcttgctgt 


tctttcgaca 


acctgctgga 


agttggtccg 


780 


atctgtcacg 


aggaggacat 


ctggctgcac 


gttgacgcag 


catatgctgg 


ctctgctttt 


840 


atctgtccgg 


aattccgtca 


cctgctgaac 


ggcgttgagt 


tcgctgattc 


tttcaacttc 


900 


aacccgcaca 


agtggctgct 


ggttaacttt 


gattgctcgg 


ctatgtgggt 


aaaacgtcgc 


960 


actgatctga 


ccggtgcatt 


taaactggac 


ccggtatatc 


tgaagcattc 


tcaccagggt 


1020 


tccggcctga 


ttaccgatta 


tcgtcattgg 


cagctgccgc 


tgggtcgtcg 


ttttcgttcg 


1080 


ctgaagatgt 


ggttcgtatt 


ccgtatgtac 


ggcgttaaag 


gtctgcaagc 


atacatccgt 


1140 


aaacacgttc 


aactgtcgca 


cgagttcgaa 


gctttcgtac 


tgcaggaccc 


gcgttttgaa 


1200 


gtttgcgctg 


aagttaccct 


gggcctggtt 


tgcttccgtc 


tgaagggttc 


tgatggtctg 


1260 


aacgaagc t c 


tcrctaQaaca 


tat taactcg 


gctcgtaaaa 


tccacctggt 


tccgtgtcgt 


1320 


ctgcgtggtc 


agttcgttct 


gcgcttcgct 


atttgttcgc 


gtaaggtaga 


gtctggtcat 


1380 


gttcgtctgg 


catgggagca 


catccgtggt 


ctggctgctg 


aactgctggc 


tgctgaagaa 


1440 


ggtaaggctg 


aaatcaaatc 


ctaa 








1464 



<210> 56 

<211> 487 

<212> PRT 

<213> Artificial Sequence 



# 



<220> 

<223> Synthetic protein derived from Sus scrofa L-aromatic amino acid d 
ecarboxylase, having a glycine inserted at position 2 in the amin 
o acid sequence 

<400> 56 

Met Gly Asn Ala Ser Asp Phe Arg Arg Arg Gly Lys Glu Met Val Asp 
1 5 10 15 



Tyr Met Ala Asp Tyr Leu Glu Gly lie Glu Gly Arg Gin Val Tyr Pro 
20 25 30 



Asp Val Gin Pro Gly Tyr Leu Arg Pro Leu lie Pro Ala Thr Ala Pro 
35 40 45 



Gin Glu Pro Asp Thr Phe Glu Asp lie Leu Gin Asp Val Glu Lys lie 
50 55 60 



lie Met Pro Gly Val Thr His Trp His Ser Pro Tyr Phe Phe Ala Tyr 
65 70 75 80 



Phe Pro Thr Ala Ser Ser Tyr Pro Ala Met Leu Ala Asp Met Leu Cys 
85 90 95 



Gly Ala lie Gly Cys lie Gly Phe Ser Trp Ala Ala Ser Pro Ala Cys 
100 105 110 



Thr Glu Leu Glu Thr Val Met Met Asp Trp Leu Gly Lys Met Leu Gin 
115 120 125 



Leu Pro Glu Ala Phe Leu Ala Gly Glu Ala Gly Glu Gly Gly Gly Val 
130 135 140 



lie Gin Gly Ser Ala Ser Glu Ala Thr Leu Val Ala Leu Leu Ala Ala 
145 150 155 160 



Arg Thr Lys Val Val Arg Arg Leu Gin Ala Ala Ser Pro Gly Leu Thr 
165 170 175 



Gin Gly Ala Val Leu Glu Lys Leu Val Ala Tyr Ala Ser Asp Gin Ala 
180 185 190 



His Ser Ser Val Glu Arg Ala Gly Leu lie Gly Gly Val Lys Leu Lys 
195 200 205 



Ala He Pro Ser Asp Gly Lys Phe Ala Met Arg Ala Ser Ala Leu Gin 
210 215 220 



Glu Ala Leu Glu Arg Asp Lys Ala Ala Gly Leu He Pro Phe Phe Val 
225 230 235 240 



Val Ala Thr Leu Gly Thr Thr Ser Cys Cys Ser Phe Asp Asn Leu Leu 
245 250 255 



Glu Val Gly Pro He Cys His Glu Glu Asp He Trp Leu His Val Asp 
260 265 270 



Ala Ala Tyr Ala Gly Ser Ala Phe He Cys Pro Glu Phe Arg His Leu 
275 280 285 



Leu Asn Gly Val Glu Phe Ala Asp Ser Phe Asn Phe Asn Pro His Lys 
290 295 300 



Trp Leu Leu Val Asn Phe Asp Cys Ser Ala Met Trp Val Lys Arg Arg 
305 310 315 320 



Thr Asp Leu Thr Gly Ala Phe Lys Leu Asp Pro Val Tyr Leu Lys His 
325 330 335 



Ser His Gin Gly Ser Gly Leu He Thr Asp Tyr Arg His Trp Gin Leu 
340 345 350 



Pro Leu Gly Arg Arg Phe Arg Ser Leu Lys Met Trp Phe Val Phe Arg 
355 360 365 



Met Tyr Gly Val Lys Gly Leu Gin Ala Tyr He Arg Lys His Val Gin 
370 375 380 



Leu Ser His Glu Phe Glu Ala Phe Val Leu Gin Asp Pro Arg Phe Glu 
385 390 395 400 



Val Cys Ala Glu Val Thr Leu Gly Leu Val Cys Phe Arg Leu Lys Gly 
405 410 415 



Ser Asp Gly Leu Asn Glu Ala Leu Leu Glu Arg He Asn Ser Ala Arg 
420 425 430 



Lys lie His Leu Val Pro Cys Arg Leu Arg Gly Gin Phe Val Leu Arg 
435 440 445 

Phe Ala lie Cys Ser Arg Lys Val Glu Ser Gly His Val Arg Leu Ala 
450 455 460 

Trp Glu His lie Arg Gly Leu Ala Ala Glu Leu Leu Ala Ala Glu Glu 
465 470 475 480 

Gly Lys Ala Glu lie Lys Ser 
485 

<210> 57 
<211> 1098 
<212> DNA 

<213> Candida boidinii 



<400> 57 
atgggtaaga 


ttgtcttagt 


tctttatgat 


gctggtaagc 


acgctgctga 


tgaagaaaaa 


60 


ttatatggtt 


gtactgaaaa 


taaattaggt 


attgctaatt 


ggttaaaaga 


tcaaggtcat 


120 


yadCLaaULa 


4— -3 /-* h- 4— 4— n 3 

L-t-dL-CLCtyd 


Udddyddyy l. 


ydddv— ddy i_ y 


ct ct l. l. y y d L- d d 


arahah^pra 
dv_.dL-dU^v»,v_d 


180 


gatgctgata 


ttatcatcac 


cactcctttc 


catcctgctt 


atatcactaa 


ggaaagactt 


240 


gacaaggcta 


agaacttaaa 


attagtcgtt 


gtcgctggtg 


ttggttctga 


tcacattgat 


300 


ttagattata 


ttaatcaaac 


aggtaagaaa 


atctcagtcc 


tggaagttac 


aggttctaat 


360 


gttgtctctg 


ttgctgaaca 


cgttgtcatg 


accatgcttg 


tcttggttag 


aaatttcgtt 


420 


ccagcacatg 


aacaaattat 


taaccacgat 


tgggaggttg 


ctgctatcgc 


taaggatgct 


480 


tacgatatcg 


aaggtaaaac 


tatcgctacc 


attggtgctg 


gtagaattgg 


ttacagagtc 


540 


ttggaaagat 


tactcccatt 


taatccaaaa 


gaattattat 


actacgatta 


tcaagcttta 


600 


ccaaaagaag 


ctgaagaaaa 


agttggtgct 


agaagagttg 


aaaatattga 


agaattagtt 


660 


gctcaagctg 


atatcgttac 


agttaatgct 


ccattacacg 


caggtacaaa 


aggtttaatt 


720 


aataaggaat 


tattatctaa 


atttaaaaaa 


99tgcttggt 


tagtcaatac 


cgcaagaggt 


780 


gctatttgtg 


ttgctgaaga 


tgttgcagca 


gctttagaat 


ctggtcaatt 


aagaggttac 


840 


ggtggtgatg 


tttggttccc 


acaaccagct 


ccaaaggatc 


acccatggag 


agatatgaga 


900 


aataaatatg 


gtgctggtaa 


tgccatgact 


cctcactact 


ctggtactac 


tttagacgct 


960 


caaacaagat 


acgctgaagg 


tactaaaaat 


attttggaat 


cattctttac 


cggtaaattt 


1020 



gattacagac cacaagatat tatcttatta aatggtgaat acgttactaa agcttacggt 1080 
aaacacgata agaaataa 1098 



<210> 58 

<211> 365 

<212> PRT 

<213> Candida boidinii 

<400> 58 

Met Gly Lys lie Val Leu Val Leu Tyr Asp Ala Gly Lys His Ala Ala 
1.5 10 15 



Asp Glu Glu Lys Leu Tyr Gly Cys Thr Glu Asn Lys Leu Gly lie Ala 
20 25 30 



Asn Trp Leu Lys Asp Gin Gly His Glu Leu lie Thr Thr Ser Asp Lys 
35 40 45 



Glu Gly Glu Thr Ser Glu Leu Asp Lys His lie Pro Asp Ala Asp lie 
50 55 60 



lie lie Thr Thr Pro Phe His Pro Ala Tyr lie Thr Lys Glu Arg Leu 
65 70 75 80 



Asp Lys Ala Lys Asn Leu Lys Leu Val Val Val Ala Gly Val Gly Ser 
85 90 95 



Asp His lie Asp Leu Asp Tyr lie Asn Gin Thr Gly Lys Lys lie Ser 
100 105 110 



Val Leu Glu Val Thr Gly Ser Asn Val Val Ser Val Ala Glu His Val 
115 120 125 



Val Met Thr Met Leu Val Leu Val Arg Asn Phe Val Pro Ala His Glu 
130 135 140 



Gin lie lie Asn His Asp Trp Glu Val Ala Ala lie Ala Lys Asp Ala 
145 150 155 160 



Tyr Asp lie Glu Gly Lys Thr lie Ala Thr lie Gly Ala Gly Arg lie 
165 170 175 



Gly Tyr Arg Val Leu Glu Arg Leu Leu Pro Phe Asn Pro Lys Glu Leu 



180 



185 



190 



Leu Tyr Tyr Asp Tyr Gin Ala Leu Pro Lys Glu Ala Glu Glu Lys Val 
195 200 205 



Gly Ala Arg Arg Val Glu Asn lie Glu Glu Leu Val Ala Gin Ala Asp 
210 215 220 



lie Val Thr Val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu lie 
225 230 235 240 



Asn Lys Glu Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Val Asn 
245 250 255 



Thr Ala Arg Gly Ala He Cys Val Ala Glu Asp Val Ala Ala Ala Leu 
260 265 270 



Glu Ser Gly Gin Leu Arg Gly Tyr Gly Gly Asp Val Trp Phe Pro Gin 
275 280 285 



Pro Ala Pro Lys Asp His Pro Trp Arg Asp Met Arg Asn Lys Tyr Gly 
290 295 300 



Ala Gly Asn Ala Met Thr Pro His Tyr Ser Gly Thr Thr Leu Asp Ala 
305 310 315 320 



Gin Thr Arg Tyr Ala Glu Gly Thr Lys Asn He Leu Glu Ser Phe Phe 
325 330 335 



Thr Gly Lys Phe Asp Tyr Arg Pro Gin Asp He He Leu Leu Asn Gly 
340 345 350 



Glu Tyr Val Thr Lys Ala Tyr Gly Lys His Asp Lys Lys 
355 360 365 



<210> 59 
<211> 1098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Candida boidinii formate dehydrogenas 
e, having numerous codons replaced with others encoding the same 
amino acids to reduce free energy of folding, and a gly codon ins 
erted after the initiating met codon to insert a restriction site 



atgggcaaaa 


tcgttctggt 


tctgtatgac 


gctggtaaac 


acgctgctga 


cgaagaaaaa 


60 


ctgtacggct 


gcaccgaaaa 


caaactgggt 


atcgctaact 


ggctgaaaga 


tcagggtcac 


120 


gaactgatca 


ctacctctga 


caaagaaggt 


gaaacctctg 


aactggacaa 


acacatcccg 


180 


gatgcagata 


tcatcatcac 


cactccgttc 


cacccggctt 


acatcaccaa 


agagcgtctg 


240 


gacaaagcta 


aaaacctgaa 


actggtagta 


gttgctggtg 


taggttctga 


ccacatcgac 


300 


ctggactaca 


tcaaccagac 


tggtaaaaaa 


atctctgtac 


tggaagtaac 


tggttctaac 


360 


gttgtttctg 


ttgctgaaca 


cgttgtaatg 


actatgctgg 


ttctggttcg 


taacttcgtt 


420 


ccggctcacg 


aacagatcat 


caaccacgat 


tgggaagttg 


cagcaatcgc 


taaagacgct 


480 


tatgacatcg 


aaggcaaaac 


catcgctact 


atcggcgctg 


gccgtatcgg 


ttaccgtgtt 


540 


ctggaacgtc 


tgctgccgtt 


caacccgaaa 


gaactgctgt 


actacgacta 


ccaggctctg 


600 


ccgaaagaag 


cagaggagaa 


agttggtgct 


cgccgtgtag 


agaacatcga 


agagctggta 


660 


gctcaggctg 


acatcgttac 


tgttaacgct 


ccgctgcacg 


caggcactaa 


aggtctgatt 


720 


aacaaagagc 


tgctgtctaa 


attcaaaaaa 


ggtgcatggc 


tggttaacac 


tgcacgtggt 


780 


gctatctgcg 


ttgctgaaga 


cgttgctgct 


gcactggaat 


ctggtcagct 


gcgtggttac 


840 


aqtQQtaacQ 


tatggtttcc 


QCaQCCQQCt 

3 -3 -3-3 


ccgaaagatc 


acccgtggcg 


tgatatgcgt 


900 


aacaaatatg 


gcgctggtaa 


cgcaatgacc 


ccgcactact 


ctggtaccac 


tctggatgct 


960 


cagacccgtt 


acgctgaagg 


tactaaaaac 


atcctggaat 


ctttcttcac 


tggtaaattc 


1020 


gactaccgcc 


cgcaggacat 


cattctgctg 


aacggtgaat 


atgtaactaa 


agcttacggc 


1080 


aaacacgaca 


aaaaataa 










1098 



<210> 60 

<211> 365 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Candida boidinii formate dehydroge 
nase, having a glycine inserted after the initiating methionine 

<400> 60 

Met Gly Lys lie Val Leu Val Leu Tyr Asp Ala Gly Lys His Ala Ala 
15 10 15 



Asp Glu Glu Lys Leu Tyr Gly Cys Thr Glu Asn Lys Leu Gly lie Ala 



20 



25 



30 



Asn Trp Leu Lys Asp Gin Gly His Glu Leu lie Thr Thr Ser Asp Lys 
35 40 45 



Glu Gly Glu Thr Ser Glu Leu Asp Lys His lie Pro Asp Ala Asp lie 
50 55 60 



lie lie Thr Thr Pro Phe His Pro Ala Tyr lie Thr Lys Glu Arg Leu 
65 70 75 80 



Asp Lys Ala Lys Asn Leu Lys Leu Val Val Val Ala Gly Val Gly Ser 
85 90 95 



Asp His lie Asp Leu Asp Tyr lie Asn Gin Thr Gly Lys Lys lie Ser 
100 105 110 



Val Leu Glu Val Thr Gly Ser Asn Val Val Ser Val Ala Glu His Val 
115 120 125 



Val Met Thr Met Leu Val Leu Val Arg Asn Phe Val Pro Ala His Glu 
130 135 140 



Gin lie lie Asn His Asp Trp Glu Val Ala Ala lie Ala Lys Asp Ala 
145 150 155 160 



Tyr Asp lie Glu Gly Lys Thr lie Ala Thr lie Gly Ala Gly Arg lie 
165 170 175 



Gly Tyr Arg Val Leu Glu Arg Leu Leu Pro Phe Asn Pro Lys Glu Leu 
180 185 190 



Leu Tyr Tyr Asp Tyr Gin Ala Leu Pro Lys Glu Ala Glu Glu Lys Val 
195 200 205 



Gly Ala Arg Arg Val Glu Asn lie Glu Glu Leu Val Ala Gin Ala Asp 
210 215 220 



He Val Thr Val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu He 
225 230 235 240 



Asn Lys Glu Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Val Asn 
245 250 255 



Thr Ala Arg Gly Ala lie Cys Val Ala Glu Asp Val Ala Ala Ala Leu 
260 265 270 



Glu Ser Gly Gin Leu Arg Gly Tyr Gly Gly Asp Val Trp Phe Pro Gin 
275 280 285 



Pro Ala Pro Lys Asp His Pro Trp Arg Asp Met Arg Asn Lys Tyr Gly 
290 295 300 



Ala Gly Asn Ala Met Thr Pro His Tyr Ser Gly Thr Thr Leu Asp Ala 
305 310 315 320 



Gin Thr Arg Tyr Ala Glu Gly Thr Lys Asn lie Leu Glu Ser Phe Phe 
325 330 335 



Thr Gly Lys Phe Asp Tyr Arg Pro Gin Asp lie lie Leu Leu Asn Gly 
340 345 350 



Glu Tyr Val Thr Lys Ala Tyr Gly Lys His Asp Lys Lys 
355 360 " 365 



<210> 61 
<211> 1488 
<212> DNA 

<213> Pseudomonas putida 
<400> 61 

atgtccctgt tgatccgtgg cgccaccgtg gtcacccacg aagagagtta ccccgccgat 60 

gtcctgtgtg tcgatggcct gatccgtgcc atcgggccaa acctcgaacc gcccaccgac 120 

tgtgaaatcc tcgacggcag cggccagtac ctgatgcccg gcggcatcga cccgcatacc 180 

cacatgcagt tgccattcat gggcaccgtg gccagcgagg atttcttcag cggcaccgca 240 

gcgggccttg ccggcggcac cacgtcgatc atcgacttcg tcatccccaa cccgcagcag 300 

tcattgctgg aggccttcca cacctggcgc ggctgggcgc agaagagcgc cagcgactac 360 

ggcttccacg ttgccatcac ctggtggagc gaacaggtgg ctgaagaaat gggcgaactg 420 

gtagccaagc atggggtgaa cagcttcaag cacttcatgg cttacaagaa tgcaatcatg 480 

gccgccgacg acaccctggt ggccagcttc gagcgctgcc tgcaactggg tgccgtgccc 540 

accgtgcatg ccgagaacgg cgaactggtg taccacctgc agaaaaaact gcttgcccag 600 

ggcatgaccg gaccagaggc tcaccccctt tcgcgccctt cacaagtgga aggtgaagcg 660 



J 



"gccagccgcg 


ccatccgtat 


tgccgaaacc 


attggtacgc 


cgctgtatgt 


ggtgcacatt 


720 


tccagccgtg 


aagcactgga 


tgaaatcacc 


tatgcacgcg 


ccaagggcca 


gccggtttac 


780 


ggcgaagtct 


tgcccggcca 


cctgctgctg 


gacgacagcg 


tctaccgtga 


cccggactgg 


840 


gccactgccg 


ctggctacgt 


gatgagcccg 


ccgttccgcc 


cgcgcgagca 


ccaggaggcg 


900 


ctgtggcgcg 


gcttgcagtc 


gggcaacctg 


cacaccacgg 


ccaccgacca 


ctgctgtttc 


960 


tgcgccgaac 


agaaagccat 


gggccgcgac 


gacttcagtc 


gcatccccaa 


cggcaccgcc 


1020 


ggcatcgaag 


accgcatggc 


ggtgctgtgg 


gatgccggtg 


tcaacagcgg 


gcgcctgtcg 


1080 


atgcatgagt 


tcgttgcgct 


gacctccacc 


aacacggcaa 


aaatcttcaa 


ccttttccca 


1140 


cgcaagggcg 


ccatccgcgt 


gggtgccgac 


gccgacctgg 


tgctgtggga 


cccgcagggc 


1200 


actcgcactc 


tatcggccca 


gacccaccac 


cagcgggtgg 


acttcaatat 


ctttgaaggc 


1260 


cgcactgtgc 


gcggggtccc 


cagccacacc 


atcagccagg 


gcaaggtgct 


ctgggccgat 


1320 


ggcgacctgc 


gtcgccgagg 


ccggggcggg 


gcggtatgtg 


gaacggccgg 


cgtatccgtc 


1380 


ggtgtacgag 


gtgctggggc 


gacgcgccga- 


acagcagcgc 


ccgacgcccg 


ttcagcgctg 


1440 


aggccattgg 


ggctgctgcg 


cagcccatcg 


ccggcaagcc 


aaatataa 




1488 



<210> 62 
<211> 495 
<212> PRT 

<213> Pseudomonas putida 
<400> 62 

Met Ser Leu Leu He Arg Gly Ala Thr Val Val Thr His Glu Glu Ser 
15 10 15 



Tyr Pro Ala Asp Val Leu Cys Val Asp Gly Leu He Arg Ala He Gly 
20 25 30 



Pro Asn Leu Glu Pro Pro Thr Asp Cys Glu He Leu Asp Gly Ser Gly 
35 40 45 



Gin Tyr Leu Met Pro Gly Gly He Asp Pro His Thr His Met Gin Leu 
50 55 60 



Pro Phe Met Gly Thr Val Ala Ser Glu Asp Phe Phe Ser Gly Thr Ala 
65 70 • 75 80 



Ala Gly Leu Ala Gly Gly Thr Thr Ser lie lie Asp Phe Val lie Pro 
85 90 95 



Asn Pro Gin Gin Ser Leu Leu Glu Ala Phe His Thr Trp Arg Gly Trp 
100 105 110 



Ala Gin Lys Ser Ala Ser Asp Tyr Gly Phe His Val Ala lie Thr Trp 
115 120 125 



Trp Ser Glu Gin Val Ala Glu Glu Met Gly Glu Leu Val Ala Lys His 
130 135 140 



Gly Val Asn Ser Phe Lys His Phe Met Ala Tyr Lys Asn Ala lie Met 
145 150 155 160 



Ala Ala Asp Asp Thr Leu Val Ala Ser Phe Glu Arg Cys Leu Gin Leu 
165 170 175 



Gly Ala Val Pro Thr Val His Ala Glu Asn Gly Glu Leu Val Tyr His 
180 185 190 



Leu Gin Lys Lys Leu Leu Ala Gin Gly Met Thr Gly Pro Glu Ala His 
195 200 205 



Pro Leu Ser Arg Pro Ser Gin Val Glu Gly Glu Ala Ala Ser Arg Ala 
210 215 220 



lie Arg He Ala Glu Thr He Gly Thr Pro Leu Tyr Val Val His He 
225 230 235 240 



Ser Ser Arg Glu Ala Leu Asp Glu He Thr Tyr Ala Arg Ala Lys Gly 
245 250 255 



Gin Pro Val Tyr Gly Glu Val Leu Pro Gly His Leu Leu Leu Asp Asp 
260 265 270 



Ser Val Tyr Arg Asp Pro Asp Trp Ala Thr Ala Ala Gly Tyr Val Met 
275 280 285 



Ser Pro Pro Phe Arg Pro Arg Glu His Gin Glu Ala Leu Trp Arg Gly 
290 295 300 



Leu Gin Ser Gly Asn Leu His Thr Thr Ala Thr Asp His Cys Cys Phe 



305 310 315 320 



Cys Ala Glu Gin Lys Ala Met Gly Arg Asp Asp Phe Ser Arg lie Pro 
325 330 335 



Asn Gly Thr Ala Gly lie Glu Asp Arg Met Ala Val Leu Trp Asp Ala 
340 345 350 



Gly Val Asn Ser Gly Arg Leu Ser Met His Glu Phe Val Ala Leu Thr 
355 360 365 



Ser Thr Asn Thr Ala Lys lie Phe Asn Leu Phe Pro Arg Lys Gly Ala 
370 375 380 



lie Arg Val Gly Ala Asp Ala Asp Leu Val Leu Trp Asp Pro Gin Gly 
385 390 395 400 



Thr Arg Thr Leu Ser Ala Gin Thr His His Gin Arg Val Asp Phe Asn 
405 410 415 



lie Phe Glu Gly Arg Thr Val Arg Gly Val Pro Ser His Thr lie Ser 
420 425 430 



Gin Gly Lys Val Leu Trp Ala Asp Gly Asp Leu Arg Arg Arg Gly Arg 
435 440 445 



Gly Gly Ala Val Cys Gly Thr Ala Gly Val Ser Val Gly Val Arg Gly 
450 455 460 



Ala Gly Ala Thr Arg Arg Thr Ala Ala Pro Asp Ala Arg Ser Ala Leu 
465 470 475 480 



Arg Pro Leu Gly Leu Leu Arg Ser Pro Ser Pro Ala Ser Gin lie 
485 490 495 



<210> 63 
<211> 1491 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Pseudomonas putida hydantoinase , havi 
ng numerous codons replaced with others encoding the same amino a 
cids to reduce free energy of folding, and a gly codon inserted a 
fter the initiating met codon to insert a restriction site 




<400> 63 
atgggctctc 


tgctgatccg 


tggtgctacc 


gttgttaccc 


acgaagaatc 


ttatccggct 


60 


gacgttctgt 


gcgttgacgg 


tctgatccgt 


gctatcggtc 


cgaacctgga 


accgccgacc 


120 


gactgcgaaa 


tcctggacgg 


ttctggtcag 


tacctgatgc 


cgggtggtat 


cgacccgcat 


180 


actcacatgc 


agctgccgtt 


tatgggtact 


gttgcttctg 


aagacttctt 


ctctggcacc 


240 


gctgctggtc 


tggctggtgg 


taccacctct 


atcatcgact 


tcgttatccc 


gaacccgcag 


300 


cagtctctgc 


tggaagcttt 


ccatacttgg 


cgtggttggg 


ctcagaaatc 


tgcatctgac 


360 


tacggtttcc 


acgttgctat 


cacctggtgg 


tctgaacagg 


ttgctgaaga 


aatgggcgaa 


420 


ctggttgcta 


aacacggtgt 


taactctttc 


aaacacttca 


tggcttacaa 


aaacgcaatt 


480 


atggcggctg 


acgacactct 


ggttgcttct 


ttcgaacgct 


gtctgcagct 


gggcgctgtt 


540 


ccgaccgttc 


acgctgaaaa 


cggcgagctg 


gtttatcacc 


tgcagaaaaa 


actgctggct 


600 


cagggtatga 


ctggcccgga 


agctcacccg 


ctgtctcgtc 


cgtctcaggt 


tgagggcgaa 


660 


gctgcttctc 


gtgctatccg 


tatcgctgaa 


accatcggta 


ccccgctgta 


tgtagttcat 


720 


atctcttctc 


gtgaagctct 


ggatgagatt 


acttacgcac 


gtgctaaggg 


tcagccggtt 


780 


tacggtgaag 


ttctgccggg 


tcatctgctg 


ctggatgatt 


ctgtataccg 


cgatccggac 


840 


tgggcaactg 


ctgctggtta 


cgttatgtcc 


ccgccgtttc 


gtccgcgtga 


gcatcaggag 


900 


gcactgtggc 


gcggcctgca 


gtctggtaac 


ctgcatacta 


ctgctactga 


tcactgttgt 


960 


ttctgcgctg 


agcagaaggc 


tatgggtcgc 


gatgacttct 


ctcgcattcc 


gaacggtact 


1020 


gctggcattg 


aggaccgtat 


ggctgttctg 


tgggatgctg 


gcgttaactc 


tggtcgtctg 


1080 


tctatgcacg 


aattcgttgc 


tctgacctct 


actaacactg 


ctaaaatctt 


caacctgttc 


1140 


ccgcgtaaag 


gtgcaatccg 


cgtaggtgca 


gatgctgatc 


tggttctgtg 


ggatccgcag 


1200 


ggcactcgca 


ctctgtctgc 


tcagactcat 


catcagcgtg 


ttgacttcaa 


catctttgag 


1260 


ggccgtactg 


ttcgcggtgt 


tccgtctcat 


accatctctc 


agggtaaagt 


tctgtgggct 


1320 


gacggtgacc 


tgcgtcgtcg 


tggtcgtggt 


ggtgctgttt 


gcggtaccgc 


tggtgtttct 


1380 


gttggtgttc 


gtggcgctgg 


tgctacccgt 


cgtactgctg 


ctccggatgc 


tcgttctgct 


1440. 


ctgcgtccgc 


tgggtctgct 


gcgttctccg 


tctccggctt 


ctcagattta 


a 


1491 



<210> 64 

<211> 496 

<212> PRT 

<213> Artificial Sequence 



<220> 

<22 3> Synthetic protein derived from Pseudomonas putida hydantoinase, h 
aving a glycine residue inserted after the initiating methionine 

<400> 64 

Met Gly Ser Leu Leu lie Arg Gly Ala Thr Val Val Thr His Glu Glu 
15 10 15 



Ser Tyr Pro Ala Asp Val Leu Cys Val Asp Gly Leu lie Arg Ala lie 
20 25 30 



Gly Pro Asn Leu Glu Pro Pro Thr Asp Cys Glu lie Leu Asp Gly Ser 
35 40 45 



Gly Gin Tyr Leu Met Pro Gly Gly lie Asp Pro His Thr His Met Gin 
50 55 60 



Leu Pro Phe Met Gly Thr Val Ala Ser Glu Asp Phe Phe Ser Gly Thr 
65 70 75 80 



Ala Ala Gly Leu Ala Gly Gly Thr Thr Ser lie lie Asp Phe Val lie 
85 90 95 



Pro Asn Pro Gin Gin Ser Leu Leu Glu Ala Phe His Thr Trp Arg Gly 
100 105 110 



Trp Ala Gin Lys Ser Ala Ser Asp Tyr Gly Phe His Val Ala lie Thr 
115 120 125 



Trp Trp Ser Glu Gin Val Ala Glu Glu Met Gly Glu Leu Val Ala Lys 
130 135 140 



His Gly Val Asn Ser Phe Lys His Phe Met Ala Tyr Lys Asn Ala lie 
145 150 155 160 



Met Ala Ala Asp Asp Thr Leu Val Ala Ser Phe Glu Arg Cys Leu Gin 
165 170 175 



Leu Gly Ala Val Pro Thr Val His Ala Glu Asn Gly Glu Leu Val Tyr 
180 185 190 



His Leu Gin Lys Lys Leu Leu Ala Gin Gly Met Thr Gly Pro Glu Ala 
195 200 205 




His Pro Leu Ser Arg Pro Ser Gin Val Glu Gly Glu Ala Ala Ser Arg 
210 215 220 



Ala He Arg He Ala Glu Thr He Gly Thr Pro Leu Tyr Val Val His 
225 230 235 240 



He Ser Ser Arg Glu Ala Leu Asp Glu He Thr Tyr Ala Arg Ala Lys 
245 250 255 



Gly Gin Pro Val Tyr Gly Glu Val Leu Pro Gly His Leu Leu Leu Asp 
260 265 270 



Asp Ser Val Tyr Arg Asp Pro Asp Trp Ala Thr Ala Ala Gly Tyr Val 
275 280 285 



Met Ser Pro Pro Phe Arg Pro Arg Glu His Gin Glu Ala Leu Trp Arg 
290 295 300 



Gly Leu Gin Ser Gly Asn Leu His Thr Thr Ala Thr Asp His Cys Cys 
305 310 315 320 



Phe Cys Ala Glu Gin Lys Ala Met Gly Arg Asp Asp Phe Ser Arg He 
325 330 335 



Pro Asn Gly Thr Ala Gly He Glu Asp Arg Met Ala Val Leu Trp Asp 
340 345 350 



Ala Gly Val Asn Ser Gly Arg Leu Ser Met His Glu Phe Val Ala Leu 
355 360 365 



Thr Ser Thr Asn Thr Ala Lys He Phe Asn Leu Phe Pro Arg Lys Gly 
370 375 380 



Ala He Arg Val Gly Ala Asp Ala Asp Leu Val Leu Trp Asp Pro Gin 
385 390 395 400 



Gly Thr Arg Thr Leu Ser Ala Gin Thr His His Gin Arg Val Asp Phe 
405 410 415 



Asn He Phe Glu Gly Arg Thr Val Arg Gly Val Pro Ser His Thr He 
420 425 430 



Ser Gin Gly Lys Val Leu Trp Ala Asp Gly Asp Leu Arg Arg Arg Gly 
435 440 445 

Arg Gly Gly Ala Val Cys Gly Thr Ala Gly Val Ser Val Gly Val Arg 
450 455 460 

Gly Ala Gly Ala Thr Arg Arg Thr Ala Ala Pro Asp Ala Arg Ser Ala 
465 470 475 480 

Leu Arg Pro Leu Gly Leu Leu Arg Ser Pro Ser Pro Ala Ser Gin lie 



<210> 65 
<211> 1683 
<212> DNA 

<213> Penicillium simplicissimum 
<400> 65 

atgtccaaga cacaggaatt caggcctttg acactgccac ccaagctgtc gttaagtgac 60 

ttcaatgaat tcatccagga tattattcga atcgttggct ctgaaaatgt tgaagtcatt 120 

agctcgaagg accagattgt tgacggttct tatatgaaac ctacgcacac gcacgatccc 180 

catcatgtca tggaccagga ctacttcctt gcctcagcaa ttgttgctcc tcgcaatgtc 240 

gccgatgtgc agtcgattgt cggacttgcc aataagttct catttcccct ctggcccatc 3 00 

tctattggaa gaaattccgg atatggcggt gctgcgccac gggttagtgg cagtgtcgtg 3 60 

ctggacatgg gaaagaatat gaacagagtt ctagaagtga acgtggaagg cgcatattgc 420 

gtggtggagc ccggtgtaac ttaccacgac ttgcataatt accttgaggc gaacaatctt 4 80 

cgagacaaat tatggcttga tgtaccggat cttggtggcg gttctgttct cggcaatgcc 540 

gttgagagag gtgtgggcta tacgccttac ggagatcatt ggatgatgca cagtgggatg 600 

gaagtcgtcc ttgcgaatgg cgagcttctt aggactggca tgggggctct acctgatcct 660 

aaacgtcccg aaacgatggg gctaaagcca gaagaccagc catggagcaa aatcgctcat 720 

ctgtttcctt atggcttcgg tccctatata gatgggctat tcagccaatc gaatatggga 780 

attgttacca agatcgggat ctggttaatg cccaatccag ggggttatca atcctacttg 840 

atcacactac ccaaagatgg tgatttaaaa caagccgtcg atattattcg tccccttcgt 900 

ctaggcatgg cccttcaaaa tgttcccact attcgccaca ttcttttgga tgcagcggtg 960 

ctcggtgaca agcgatctta ttcatccaag accgaacccc tctccgacga ggaattagac 1020 







• 






• 




aagatcgcga 


aacagctcaa 


cttgggacga 


tggaactttt 


acggggcgct 


ctatggacct 


1080 


gagccgattc 


gaagggttct 


ctgggaaacg 


attaaagacg 


cattctcggc 


gatcccaggc 


1140 


gtcaagtttt 


attttccgga 


ggacactcct 


gaaaactccg 


ttctccgcgt 


gcgtgataag 


1200 


actatgcaag 


gcattccaac 


ttacgacgag 


ctaaagtgga 


tcgattggct 


ccctaatggt 


1260 


gcgcatctgt 


tcttctctcc 


tattgcgaag 


gtatctggtg 


aagatgcaat 


gatgcaatac 


1320 


gcagtcacca 


agaaaaggtg 


tcaggaggct 


gggttagatt 


ttatcggcac 


tttcacagtc 


1380 


ggtatgagag 


agatgcatca 


tatcgtttgt 


attgtgttca 


acaagaagga 


cctaatacaa 


1440 


aagagaaaag 


tacagtggct 


gatgagaacc 


cttattgatg 


actgtgctgc 


aaatggatgg 


1500 


ggcgaatatc 


gaacccatct 


ggccttcatg 


gaccaaatta 


tggaaaccta 


caactggaac 


1560 


aacagcagct 


tcctaaggtt 


caatgaggtc 


ctcaagaatg 


cggtggatcc 


taatggcatc 


1620 


attgccccgg 


gaaagtctgg 


tgtttggccg 


agtcaataca 


gtcatgttac 


ttggaaactg 


1680 


taa 












1683 



<210> 66 
<211> 560 
<212> PRT 

<213> Penicillium simplicissimum 
<400> 66 

Met Ser Lys Thr Gin Glu Phe Arg Pro Leu Thr Leu Pro Pro Lys Leu 
15 10 15 



Ser Leu Ser Asp Phe Asn Glu Phe lie Gin Asp lie lie Arg lie Val 
20 25 30 



Gly Ser Glu Asn Val Glu Val lie Ser Ser Lys Asp Gin lie Val Asp 
35 40 45 



Gly Ser Tyr Met Lys Pro Thr His Thr His Asp Pro His His Val Met 
50 55 60 



Asp Gin Asp Tyr Phe Leu Ala Ser Ala lie Val Ala Pro Arg Asn Val 
65 70 75 80 



Ala Asp Val Gin Ser lie Val Gly Leu Ala Asn Lys Phe Ser Phe Pro 
85 90 95 



Leu Trp Pro lie Ser lie Gly Arg Asn Ser Gly Tyr Gly Gly Ala Ala 



100 105 110 



Pro Arg Val Ser Gly Ser Val Val Leu Asp Met Gly Lys Asn Met Asn 
115 120 125 



Arg Val Leu Glu Val Asn Val Glu Gly Ala Tyr Cys Val Val Glu Pro 
130 135 140 



Gly Val Thr Tyr His Asp Leu His Asn Tyr Leu Glu Ala Asn Asn Leu 
145 150 155 160 



Arg Asp Lys Leu Trp Leu Asp Val Pro Asp Leu Gly Gly Gly Ser Val 
165 170 175 



Leu Gly Asn Ala Val Glu Arg Gly Val Gly Tyr Thr Pro Tyr Gly Asp 
180 185 190 



His Trp Met Met His Ser Gly Met Glu Val Val Leu Ala Asn Gly Glu 
195 200 205 



Leu Leu Arg Thr Gly Met Gly Ala Leu Pro Asp Pro Lys Arg Pro Glu 
210 215 220 



Thr Met Gly Leu Lys Pro Glu Asp Gin Pro Trp Ser Lys lie Ala His 
225 230 235 240 



Leu Phe Pro Tyr Gly Phe Gly Pro Tyr lie Asp Gly Leu Phe Ser Gin 
245 250 255 



Ser Asn Met Gly lie Val Thr Lys lie Gly lie Trp Leu Met Pro Asn 
260 265 270 



Pro Gly Gly Tyr Gin Ser Tyr Leu lie Thr Leu Pro Lys Asp Gly Asp 
275 280 285 



Leu Lys Gin Ala Val Asp lie lie Arg Pro Leu Arg Leu Gly Met Ala 
290 295 300 



Leu Gin Asn Val Pro Thr lie Arg His lie Leu Leu Asp Ala Ala Val 
305 310 315 320 



Leu Gly Asp Lys Arg Ser Tyr Ser Ser Lys Thr Glu Pro Leu Ser Asp 
325 330 335 



Glu Glu Leu Asp Lys lie Ala Lys Gin Leu Asn Leu Gly Arg Trp Asn 
340 345 350 



Phe Tyr Gly Ala Leu Tyr Gly Pro Glu Pro lie Arg Arg Val Leu Trp 
355 360 365 



Glu Thr lie Lys Asp Ala Phe Ser Ala lie Pro Gly Val Lys Phe Tyr 
370 375 380 



Phe Pro Glu Asp Thr Pro Glu Asn Ser Val Leu Arg Val Arg Asp Lys 
385 390 395 400 



Thr Met Gin Gly lie Pro Thr Tyr Asp Glu Leu Lys Trp lie Asp Trp 
405 410 415 



Leu Pro Asn Gly Ala His Leu Phe Phe Ser Pro lie Ala Lys Val Ser 
420 425 430 



Gly Glu Asp Ala Met Met Gin Tyr Ala Val Thr Lys Lys Arg Cys Gin 
435 440 445 



Glu Ala Gly Leu Asp Phe lie Gly Thr Phe Thr Val Gly Met Arg Glu 
450 455 460 



Met His His lie Val Cys lie Val Phe Asn Lys Lys Asp Leu lie Gin 
465 470 475 480 



Lys Arg Lys Val Gin Trp Leu Met Arg Thr Leu lie Asp Asp Cys Ala 
485 490 495 



Ala Asn Gly Trp Gly Glu Tyr Arg Thr His Leu Ala Phe Met Asp Gin 
500 505 510 



lie Met Glu Thr Tyr Asn Trp Asn Asn Ser Ser Phe Leu Arg Phe Asn 
515 520 525 



Glu Val Leu Lys Asn Ala Val Asp Pro Asn Gly lie lie Ala Pro Gly 
530 535 540 



Lys Ser Gly Val Trp Pro Ser Gin Tyr Ser His Val Thr Trp Lys Leu 
545 550 555 560 



<210> 67 
<211> 1686 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Penicillium simplicissium vanillyl al 
cohol oxidase, having numerous codons replaced with others encodi 
ng the same amino acids to reduce free energy of folding and a gl 
y codon inserted after the initiating met codon to insert a restr 
iction site 



<400> 67 



atgggctcta 


aaactcagga 


gttccgtccg 


ctgaccctgc 


cgccgaaact 


gtctctgtct 


60 


gattttaacg 


aattcatcca 


ggatatcatc 


cgtatcgttg 


gttctgaaaa 


cgttgaagtt 


120 


atctcttcta 


aagaccagat 


cgttgacggt 


tcttacatga 


aaccgaccca 


cacccacgac 


180 


ccgcaccacg 


ttatggacca 


ggactacttc 


ctggcttctg 


ctatcgttgc 


tccgcgtaac 


240 


gttgctgacg 


ttcagtctat 


cgttggtctg 


gctaacaaat 


tctctttccc 


gctgtggccg 


300 


atctctatcg 


gtcgtaactc 


tggttacggt 


ggtgctgctc 


cgcgtgtttc 


tggttctgtt 


360 


gttctggaca 


tgggtaaaaa 


catgaaccgt 


gttctggaag 


ttaacgttga 


aggtgcttac 


420 


tgcgttgttg 


aaccgggtgt 


aacttatcat 


gacctgcaca 


actacctgga 


agctaacaac 


480 


ctgcgtgaca 


aactgtggct 


ggacgtaccg 


gatctgggtg 


gtggttctgt 


tctgggtaac 


540 


gctgttgaac 


gtggtgttgg 


ttacaccccg 


tacggtgatc 


attggatgat 


gcactctggc 


600 


atggaggtag 


tactggctaa 


cggtgaactg 


ctgcgtaccg 


gtatgggtgc 


tctgccggac 


660 


ccgaagcgtc 


cggaaac tat 


QQatctcraaQ 


ccggaggatc 


aqccgtgqtc 


taaaatcgct 


720 


catctgttcc 


cgtatggctt 


tggtccgtac 


atcgacggtc 


tgttctctca 


gtctaacatg 


780 


ggtatcgtta 


ccaaaattgg 


catttggctg 


atgccgaacc 


cgggtggtta 


ccagtcttac 


840 


ctgattactc 


tgccgaaaga 


tggcgacctg 


aaacaggctg 


ttgatatcat 


tcgcccgctg 


900 


cgtctgggta 


tggctctgca 


gaacgttccg 


actatccgcc 


acatcctgct 


ggacgctgca 


960 


gtactgggtg 


acaaacgttc 


ctactcctct 


aaaactgaac 


cgctgtctga 


cgaagaactg 


1020 


gacaaaatcg 


ctaaacagct 


gaacctgggt 


cgttggaact 


tctacggtgc 


tctgtacggt 


1080 


ccggaaccga 


tccgtcgtgt 


tctgtgggag 


actatcaagg 


atgctttctc 


tgctatcccg 


1140 


ggtgttaaat 


tctacttccc 


ggaagacact 


ccggaaaact 


ctgttctgcg 


tgtacgtgac 


1200 


aaaaccatgc 


agggtatccc 


gacctacgac 


gaactgaaat 


ggatcgactg 


gctgccgaac 


1260 


ggtgctcacc 


tgttcttttc 


tccgatcgct 


aaagtatccg 


gagaggacgc 


tatgatgcag 


1320 





tatgctgtta 


ccaaaaaacg 


ttgtcaggaa 


gctggtctgg 


atttcattgg 


taccttcact 


1380 


gtaggtatgc 


gcgaaatgca 


tcatattgtt 


tgcatcgttt 


tcaacaaaaa 


agacctgatt 


1440 


cagaagcgca 


aagttcagtg 


gctgatgcgt 


accctgatcg 


acgactgtgc 


tgctaacggt 


1500 


tggggtgaat 


accgtaccca 


cctggcattc 


atggaccaga 


tcatggaaac 


ctacaactgg 


1560 


aacaactctt 


ctttcctgcg 


tttcaacgaa 


gttctgaaaa 


acgctgttga 


cccgaacggt 


1620 


atcatcgctc 


cgggtaaatc 


tggtgtttgg 


ccgtctcagt 


actctcacgt 


tacctggaaa 


1680 


ctgtaa 












1686 



<210> 68 

<211> 561 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Penicillium simplicissium vanillyl 



alcohol oxidase, having a glycine residue inserted after the ini 
tiating methionine 



<400> 



68 



Met Gly Ser Lys Thr Gin Glu Phe Arg Pro Leu Thr Leu Pro Pro Lys 
15 10 15 



Leu Ser Leu Ser Asp Phe Asn Glu Phe lie Gin Asp lie lie Arg lie 
20 25 30 



Val Gly Ser Glu Asn Val Glu Val lie Ser Ser Lys Asp Gin lie Val 
35 40 45 



Asp Gly Ser Tyr Met Lys Pro Thr His Thr His Asp Pro His His Val 
50 55 60 



Met Asp Gin Asp Tyr Phe Leu Ala Ser Ala lie Val Ala Pro Arg Asn 
65 70 75 80 



Val Ala Asp Val Gin Ser lie Val Gly Leu Ala Asn Lys Phe Ser Phe 
85 90 95 



Pro Leu Trp Pro lie Ser lie Gly Arg Asn Ser Gly Tyr Gly Gly Ala 
100 105 110 



Ala Pro Arg Val Ser Gly Ser Val Val Leu Asp Met Gly Lys Asn Met 



115 



120 



125 



Asn Arg Val Leu Glu Val Asn Val Glu Gly Ala Tyr Cys Val Val Glu 
130 135 140 



Pro Gly Val Thr Tyr His Asp Leu His Asn Tyr Leu Glu Ala Asn Asn 
145 150 155 160 



Leu Arg Asp Lys Leu Trp Leu Asp Val Pro Asp Leu Gly Gly Gly Ser 
165 170 175 



Val Leu Gly Asn Ala Val Glu Arg Gly Val Gly Tyr Thr Pro Tyr Gly 
180 185 190 



Asp His Trp Met Met His Ser Gly Met Glu Val Val Leu Ala Asn Gly 
195 200 205 



Glu Leu Leu Arg Thr Gly Met Gly Ala Leu Pro Asp Pro Lys Arg Pro 
210 215 220 



Glu Thr Met Gly Leu Lys Pro Glu Asp Gin Pro Trp Ser Lys lie Ala 
225 230 235 240 



His Leu Phe Pro Tyr Gly Phe Gly Pro Tyr lie Asp Gly Leu Phe Ser 
245 250 255 



Gin Ser Asn Met Gly lie Val Thr Lys lie Gly lie Trp Leu Met Pro 
260 265 270 



Asn Pro Gly Gly Tyr Gin Ser Tyr Leu lie Thr Leu Pro Lys Asp Gly 
275 280 285 



Asp Leu Lys Gin Ala Val Asp lie lie Arg Pro Leu Arg Leu Gly Met 
290 295 300 



Ala Leu Gin Asn Val Pro Thr lie Arg His lie Leu Leu Asp Ala Ala 
305 310 315 320 



Val Leu Gly Asp Lys Arg Ser Tyr Ser Ser Lys Thr Glu Pro Leu Ser 
325 330 335 



Asp Glu Glu Leu Asp Lys lie Ala Lys Gin Leu Asn Leu Gly Arg Trp 
340 345 350 



Asn Phe Tyr Gly Ala Leu Tyr Gly Pro Glu Pro lie Arg Arg Val Leu 
355 360 365 



Trp Glu Thr lie Lys Asp Ala Phe Ser Ala lie Pro Gly Val Lys Phe 
370 375 380 



Tyr Phe Pro Glu Asp Thr Pro Glu Asn Ser Val Leu Arg Val Arg Asp 
385 390 395 400 



Lys Thr Met Gin Gly lie Pro Thr Tyr Asp Glu Leu Lys Trp lie Asp 
405 410 415 



Trp Leu Pro Asn Gly Ala His Leu Phe Phe Ser Pro lie Ala Lys Val 
420 425 430 



Ser Gly Glu Asp Ala Met Met Gin Tyr Ala Val Thr Lys Lys Arg Cys 
435 440 445 



Gin Glu Ala Gly Leu Asp Phe lie Gly Thr Phe Thr Val Gly Met Arg 
450 455 460 



Glu Met His His lie Val Cys lie Val Phe Asn Lys Lys Asp Leu lie 
465 470 475 480 



Gin Lys Arg Lys Val Gin Trp Leu Met Arg Thr Leu lie Asp Asp Cys 
485 490 495 



Ala Ala Asn Gly Trp Gly Glu Tyr Arg Thr His Leu Ala Phe Met Asp 
500 505 510 



Gin lie Met Glu Thr Tyr Asn Trp Asn Asn Ser Ser Phe Leu Arg Phe 
515 520 525 



Asn Glu Val Leu Lys Asn Ala Val Asp Pro Asn Gly lie lie Ala Pro 
530 535 540 



Gly Lys Ser Gly Val Trp Pro Ser Gin Tyr Ser His Val Thr Trp Lys 
545 550 555 560 



Leu 



<210> 69 

<211> 852 

<212> DNA 

<213> Candida magnoliae 

<400> 69 



atggctaaga 


acttctccaa 


cgtcgagtac 


cccgccccgc 


ctccggccca 


caccaagaac 


60 


gagtcgctgc 


aggtccttga 


cctgttcaag 


ctgaatggca 


aggttgccag 


catcactggc 


12 0 


tcgtccagcg 


gtattggcta 


cgctctggct 


gaggccttcg 


cgcaggtcgg 


cgctgacgtc 


180 


gccatctggt 


acaacagcca 


cgacgctact 


ggcaaggctg 


aggccctcgc 


caagaagtac 


240 


ggcgtcaagg 


tcaaggccta 


caaggcgaac 


gtgagcagct 


ctgacgccgt 


gaagcagacg 


300 


atcgagcagc 


agatcaagga 


cttcggccac 


ctcgacattg 


tcgtggcgaa 


cgccggcatt 


360 


ccctggacga 


agggtgccta 


catcgaccag 


gacgacgaca 


agcacttcga 


ccaggtcgtt 


420 


gacgtcgatc 


tgaagggtgt 


tggatacgtc 


gcgaagcacg 


ctggccgtca 


cttccgcgag 


480 


cgcttcgaga 


aggagggcaa 


gaagggcgcc 


cttgtgttca 


cggcctccat 


gtctggccac 


540 


attgtgaacg 


tgccccagtt 


ccaggccacg 


tacaacgcgg 


ccaaggctgg 


cgtgcgccac 


600 


ttcgcgaagt 


cgctggccgt 


cgagttcgcg 


ccgttcgcgc 


gcgtgaactc 


tgtgtcgccg 


660 


ggctacatca 


acacggagat 


ctcggacttc 


gtgccccagg 


agacgcagaa 


caagtggtgg 


720 


tcgctcgtgc 


cccttggccg 


cggcggagag 


acggccgagc 


tcgttggcgc 


ctacctgttc 


780 


cttgcatctg 


acgccggctc 


gtacgccact 


ggtacggaca 


tcattgttga 


cggtggctac 


840 


acgcttccct 


aa 










852 



<210> 70 
<211> 283 
<212> PRT 

<213> Candida magnoliae 
<400> 70 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
15 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 

Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly lie Gly Tyr Ala 
35 40 45 



Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 



Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 80 



Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 



Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 105 110 



He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 125 



Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 



Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 



Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 



Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 



Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 



Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 



Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 



Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 



Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 



Asp He He Val Asp Gly Gly Tyr Thr Leu Pro 



275 280 

<210> 71 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Candida magnoliae NAD PH -dependent car 
bonyl reductase, having numerous codons replaced with others enco 
ding the same amino acids to reduce the free energy of folding 



<400> 71 
atggctaaaa 


acttctctaa 


cgttgaatac 


ccggctccgc 


cgccagctca 


caccaaaaac 


60 


gaatctctgc 


aggttctgga 


cctgttcaaa 


ctgaacggta 


aggttgcttc 


tatcaceggt 


120 


tcttcttctg 


gtatcggtta 


cgctctggct 


gaagcattcg 


ctcaggtagg 


tgctgacgtt . 


180 


gctatctggt 


acaactctca 


cgacgctact 


ggtaaggctg 


aagctctggc 


taaaaaatac 


240 


ggtgttaaag 


ttaaagctta 


caaggctaac 


gtttcttctt 


ctgacgctgt 


aaaacagacc 


300 


at cgaacagc 


agaucaaaga 




v_ L-yycii~.ciL.i_y 


l. u y u l. o a. 


ccrr'tcrcrt* a t" c* 


360 


ccgtggacca 


aaggtgctta 


catcgaccag 


gacgacgata 


aacacttcga 


tcaggttgtt 


420 


gacgttgatc 


tgaaaggtgt 


tggttatgtt 


gctaaacacg 


ctggccgtca 


ettcegtgag 


480 


cgtttcgaaa 


aggaaggtaa 


gaaaggcgct 


ctggttttca 


ccgcttctat 


gtctggtcac 


540 


atcgttaacg 


taccgcagtt 


tcaggctacc 


tacaacgctg 


ctaaagctgg 


tgttcgtcac 


600 


ttcgctaaat 


ctctggctgt 


agaattcgct 


ccgttcgctc 


gtgttaactc 


tgtttctccg 


660 


ggctacatca 


acaccgaaat 


ctctgacttt 


fit - s prnr a c\n 
y L,ct<—i_y L-cty y 


delete L-^ciy clcl 


i_ ci cl cl l- y y uy y 


720 


tctctggtac 


cgctgggccg 


tggtggcgaa 


actgctgaac 


tggttggtgc 


ttacctgttt 


780 


ctggcttctg 


acgctggttc 


ttacgctacc 


ggcactgaca 


tcatcgttga 


cggtggttac 


840 


accctgccgt 


aa 










852 


<210> 72 
<211> 1602 
<212> DNA 

<213> Saccharomyces cerevisiae 










<400> 72 
atgacagaag 


ataatattgc 


tccaatcacc 


tccgttaaag 


tagttaccga 


caagtgcacg 


60 


tacaaggaca 


acgagctgct 


caccaagtac 


agctacgaaa 


atgctgtagt 


tacgaagaca 


120 


gctagtggcc 


gcttcgatgt 


aacgcccact 


gttcaagact 


acgtgttcaa 


acttgacttg 


180 



aaaaagccgg 


aaaaactagg 


aattatgctc 


attgggttag 


gtggcaacaa 


tggctccact 


240 


ttagtggcct 


cggtattggc 


gaataagcac 


aatgtggagt 


ttcaaactaa 


ggaaggcgtt 


300 


aagcaaccaa 


actacttcgg 


ctccatgact 


caatgttcta 


ccttgaaact 


gggtatcgat 


360 


gcggagggga 


atgacgttta 


tgctcctttt 


aactctctgt 


tgcccatggt 


tagcccaaac 


420 


gactttgtcg 


tctctggttg ggacatcaat 


aacgcagatc 


tatacgaagc 


tatgcagaga 


480 


agtcaagttc 


tcgaatatga 


tctgcaacaa 


cgcttgaagg 


cgaagatgtc 


cttggtgaag 


540 


cctcttcctt 


ccatttacta 


ccctgatttc 


attgcagcta 


atcaagatga 


gagagccaat 


600 


aactgcatca 


atttggatga 


aaaaggcaac 


gtaaccacga 


ggggtaagtg 


gacccatctg 


660 


caacgcatca 


gacgcgatat 


ccagaatttc 


aaagaagaaa 


acgcccttga 


taaagtaatc 


720 


gttctttgga 


ctgcaaatac 


tgagaggtac 


gtagaagtat 


ctcctggtgt 


taatgacacc 


780 


atggaaaacc 


tcttgcagtc 


tattaagaat 


gaccatgaag 


agattgctcc 


ttccacgatc 


840 


tttgcagcag 


catctatctt 


ggaaggtgtc 


ccctatatta 


atggttcacc 


gcagaatact 


900 


tttgttcccg 


gcttggttca 


gctggctgag 


catgagggta 


cattcattgc 


gggagacgat 


960 


ctcaagtcgg 


gacaaaccaa 


gttgaagtct 


gttctggccc 


agttcttagt 


ggatgcaggt 


1020 


attaaaccgg 


tctccattgc 


atcctataac 


catttaggca 


ataatgacgg 


ttataactta 


1080 


tctgctccaa 


aacaatttag 


gtctaaggag 


atttccaaaa 


gttctgtcat 


agatgacatc 


1140 


atcgcgtcta 


atgatatctt 


gtacaatgat 


aaactgggta 


aaaaagttga 


ccactgcatt 


1200 


gtcatcaaat 


atatgaagcc 


cgtcggggac 


tcaaaagtgg 


caatggacga 


gtattacagt 


1260 


gagttgatgt 


taggtggcca 


taaccggatt 


tccattcaca 


atgtttgcga 


agattcttta 


1320 


ctggctacgc 


ccttgatcat 


cgatctttta 


gtcatgactg 


agttttgtac 


aagagtgtcc 


1380 


tataaaaaaa 


tggacccagt 


taaagaagat 


gctggcaaat 


tcgagaactt 


ttatccagtt 


1440 


ttaaccttct 


tgagttactg 


gttaaaagct 


ccattaacaa 


gaccaggatt 


tcacccggtg 


1500 


aatggcttaa 


acaagcaaag 


aaccgcctta 


gaaaattttt 


taagattgtt 


gattggattg 


■ 1560 


ccttctcaaa 


acgaactaag 


attcgaagag 


agattgttgt 


aa 




1602 



<210> 73 
<211> 533 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 73 

Met Thr Glu Asp Asn He Ala Pro He Thr Ser Val Lys Val Val Thr 



10 15 



Asp Lys Cys Thr Tyr Lys Asp Asn Glu Leu Leu Thr Lys Tyr Ser Tyr 
20 25 30 



Glu Asn Ala Val Val Thr Lys Thr Ala Ser Gly Arg Phe Asp Val Thr 
35 40 45 



Pro Thr Val Gin Asp Tyr Val Phe Lys Leu Asp Leu Lys Lys Pro Glu 
50 55 60 



Lys Leu Gly lie Met Leu lie Gly Leu Gly Gly Asn Asn Gly Ser Thr 
65 70 75 80 



Leu Val Ala Ser Val Leu Ala Asn Lys His Asn Val Glu Phe Gin Thr 
85 90 95 



Lys Glu Gly Val Lys Gin Pro Asn Tyr Phe Gly Ser Met Thr Gin Cys 
100 105 110 



Ser Thr Leu Lys Leu Gly lie Asp Ala Glu Gly Asn Asp Val Tyr Ala 
115 120 125 



Pro Phe Asn Ser Leu Leu Pro Met Val Ser Pro Asn Asp Phe Val Val 
130 135 140 



Ser Gly Trp Asp lie Asn Asn Ala Asp Leu Tyr Glu Ala Met Gin Arg 
145 150 155 160 



Ser Gin Val Leu Glu Tyr Asp Leu Gin Gin Arg Leu Lys Ala Lys Met 
165 170 175 



Ser Leu Val Lys Pro Leu Pro Ser lie Tyr Tyr Pro Asp Phe lie Ala 
180 185 190 



Ala Asn Gin Asp Glu Arg Ala Asn Asn Cys lie Asn Leu Asp Glu Lys 
195 200 205 



Gly Asn Val Thr Thr Arg Gly Lys Trp Thr His Leu Gin Arg lie Arg 
210 215 220 



Arg Asp lie Gin Asn Phe Lys Glu Glu Asn Ala Leu Asp Lys Val lie 
225 230 235 240 



Val Leu Trp Thr Ala Asn Thr Glu Arg Tyr Val Glu Val Ser Pro Gly 
245 250 255 



Val Asn Asp Thr Met Glu Asn Leu Leu Gin Ser He Lys Asn Asp His 
260 265 270 



Glu Glu He Ala Pro Ser Thr He Phe Ala Ala Ala Ser He Leu Glu 
275 280 285 



Gly Val Pro Tyr He Asn Gly Ser Pro Gin Asn Thr Phe Val Pro Gly 
290 295 300 



Leu Val Gin Leu Ala Glu His Glu Gly Thr Phe He Ala Gly Asp Asp 
305 310 315 320 



Leu Lys Ser Gly Gin Thr Lys Leu Lys Ser Val Leu Ala Gin Phe Leu 
325 330 335 



Val Asp Ala Gly He Lys Pro Val Ser He Ala Ser Tyr Asn His Leu 
340 345 350 



Gly Asn Asn Asp Gly Tyr Asn Leu Ser Ala Pro Lys Gin Phe Arg Ser 
355 360 365 



Lys Glu He Ser Lys Ser Ser Val He Asp Asp He He Ala Ser Asn 
370 375 380 



Asp He Leu Tyr Asn Asp Lys Leu Gly Lys Lys Val Asp His Cys He 
385 390 395 400 



Val He Lys Tyr Met Lys Pro Val Gly Asp Ser Lys Val Ala Met Asp 
405 410 415 



Glu Tyr Tyr Ser Glu Leu Met Leu Gly Gly His Asn Arg He Ser He 
420 425 430 



His Asn Val Cys Glu Asp Ser Leu Leu Ala Thr Pro Leu He He Asp 
435 440 445 



Leu Leu Val 
450 



Met Thr Glu Phe Cys Thr Arg Val Ser Tyr Lys Lys Val 
455 460 



Asp Pro Val Lys Glu Asp Ala Gly Lys Phe Glu Asn Phe Tyr Pro Val 
465 470 475 480 



Leu Thr Phe Leu Ser Tyr Trp Leu Lys Ala Pro Leu Thr Arg Pro Gly 
485 490 495 



Phe His Pro Val Asn Gly Leu Asn Lys Gin Arg Thr Ala Leu Glu Asn 
500 505 510 



Phe Leu Arg Leu Leu lie Gly Leu Pro Ser Gin Asn Glu Leu Arg Phe 
515 520 525 . 



Glu Glu Arg Leu Leu 
530 



<210> 74 
<211> 1605 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Saccharomyces cerevisiae myo-inositol 
-1 -phosphate synthase, having numerous codons replaced with other 
s encoding the same amino acids to reduce free energy of folding, 
and a gly codon inserted after the initiating met codon 



<400> 74 
atgggtaccg 


aagataacat 


cgctccaatc 


acttctgtta 


aagttgtaac 


tgacaaatgt 


60 


acttacaaag 


acaacgaact 


gctgactaaa 


tactcttacg 


aaaacgctgt 


agtaactaaa 


120 


actgcttctg 


gtcgtttcga 


tgttactccg 


actgttcagg 


actacgtatt 


caaactggat 


180 


ctgaagaaac 


cggaaaagct 


gggtatcatg 


ctgatcggcc 


tgggtggtaa 


caacggctct 


240 


actctggttg 


catctgttct 


ggcaaacaaa 


cacaacgtag 


aattccagac 


taaggaaggt 


300 


gttaaacagc 


cgaactactt 


tggttctatg 


actcagtgtt 


ctactctgaa 


gctgggcatt 


360 


gatgctgaag 


gtaacgacgt 


ttacgctccg 


ttcaactctc 


tgctgccgat 


ggtatctccg 


420 


aacgacttcg 


ttgtttctgg 


ttgggatatc 


aacaacgcgg 


atctgtacga 


agcaatgcag 


480 


cgttctcagg 


ttctggaata 


tgatctgcaa 


cagcgtctga 


aggctaagat 


gtctctggtt 


540 


aagccactgc 


cgtccatcta 


ctacccggat 


tttatcgcag 


ctaaccagga 


cgaacgtgct 


600 


aacaactgta 


tcaacctgga 


cgaaaagggt 


aacgttacta 


cccgtggtaa 


gtggactcac 


660 


ctgcagcgta 


tccgtcgtga 


tatccagaac 


ttcaaagagg 


aaaacgcact 


ggacaaagtt 


720 



atcgtactgt 


ggactgctaa 


cactgaacgt 


tacgtagaag 


tatccccggg 


tgtaaacgat 


780 


actatggaaa 


acctgctgca 


atctatcaag 


aacgaccacg 


aggaaatcgc 


tccgtccacc 


840 


atcttcgctg 


ctgcatctat 


cctggaaggc 


gtaccgtaca 


tcaacggctc 


tccgcagaac 


900 


actttcgtac 


cgggtctggt 


acagctggct 


gaacacgaag 


gtaccttcat 


cgctggtgac 


960 


gatctgaaat 


ctggccagac 


taaactgaaa 


tctgtactgg 


cacagttcct 


ggttgacgct 


1020 


ggtatcaaac 


cggtttctat 


cgcttcttat 


aaccacctgg 


gtaacaacga 


cggctacaac 


1080 


ctgtctgctc 


cgaaacagtt 


ccgttctaaa 


gaaatctcta 


aatcctctgt 


aatcgacgac 


1140 


atcatcgctt 


ctaacgacat 


cctgtacaac 


gacaaactgg 


gtaagaaagt 


agatcactgt 


1200 


atcgttatca 


aatacatgaa 


accggttggt 


gattctaaag 


ttgctatgga 


cgaatactac 


1260 


tctgaactga 


tgctgggcgg 


tcacaaccgt 


atctctatcc 


acaacgtttg 


tgaagactct 


1320 


ctgctggcta 


ccccgctgat 


catcgacctg 


ctggttatga 


ctgaattctg 


tacccgtgta 


1380 


tcttacaaga 


aagttgaccc 


ggttaaagaa 


gatgctggca 


aattcgaaaa 


cttctacccg 


1440 


gttctgacct 


tcctgtctta 


ctggctgaaa 


gctccgctga 


ctcgtccagg 


cttccacccg 


1500 


gttaacggtc 


tgaacaaaca 


gcgtaccgct 


ctggaaaact 


tcctgcgtct 


gctgatcggc 


1560 


ctgccgtccc 


agaacgaact 


gcgtttcgaa 


gaacgtctgc 


tgtaa 




1605 



<210> 75 
<211> 534 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Saccharomyces cerevisiae myo-inosi 
tol-1 -phosphate synthase, having a glycine residue inserted after 
the initiating methionine 

<400> 75 

Met Gly Thr Glu Asp Asn lie Ala Pro lie Thr Ser Val Lys Val Val 
15 10 15 

Thr Asp Lys Cys Thr Tyr Lys Asp Asn Glu Leu Leu Thr Lys Tyr Ser 
20 25 30 

Tyr Glu Asn Ala Val Val Thr Lys Thr Ala Ser Gly Arg Phe Asp Val 
35 40 45 



Thr Pro Thr Val Gin Asp Tyr Val Phe Lys Leu Asp Leu Lys Lys Pro 
50 55 60 



Glu Lys Leu Gly lie Met Leu lie Gly Leu Gly Gly Asn Asn Gly Ser 
65 70 75 80 



Thr Leu Val Ala Ser Val Leu Ala Asn Lys His Asn Val Glu Phe Gin 
85 90 95 



Thr Lys Glu Gly Val Lys Gin Pro Asn Tyr Phe Gly Ser Met Thr Gin 
100 105 110 



Cys Ser Thr Leu Lys Leu Gly lie Asp Ala Glu Gly Asn Asp Val Tyr 
115 120 125 



Ala Pro Phe Asn Ser Leu Leu Pro Met Val Ser Pro Asn Asp Phe Val 
130 135 140 



Val Ser Gly Trp Asp lie Asn Asn Ala Asp Leu Tyr Glu Ala Met Gin 
145 150 155 160 



Arg Ser Gin Val Leu Glu Tyr Asp Leu Gin Gin Arg Leu Lys Ala Lys 
165 170 175 



Met Ser Leu Val Lys Pro Leu Pro Ser lie Tyr Tyr Pro Asp Phe lie 
180 185 190 



Ala Ala Asn Gin Asp Glu Arg Ala Asn Asn Cys lie Asn Leu Asp Glu 
195 200 205 



Lys Gly Asn Val Thr Thr Arg Gly Lys Trp Thr His Leu Gin Arg lie 
210 215 220 



Arg Arg Asp lie Gin Asn Phe Lys Glu Glu Asn Ala Leu Asp Lys Val 
225 230 235 240 



lie Val Leu Trp Thr Ala Asn Thr Glu Arg Tyr Val Glu Val Ser Pro 
245 250 255 



Gly Val Asn Asp Thr Met Glu Asn Leu Leu Gin Ser lie Lys Asn Asp 
260 265 270 



His Glu Glu lie Ala Pro Ser Thr lie Phe Ala Ala Ala Ser lie Leu 
275 280 285 



Glu Gly Val Pro Tyr lie Asn Gly Ser Pro Gin Asn Thr Phe Val Pro 
290 295 300 



Gly Leu Val Gin Leu Ala Glu His Glu Gly Thr Phe He Ala Gly Asp 
305 310 315 320 



Asp Leu Lys Ser Gly Gin Thr Lys Leu Lys Ser Val Leu Ala Gin Phe 
325 330 335 



Leu Val Asp Ala Gly He Lys Pro Val Ser He Ala Ser Tyr Asn His 
340 345 350 



Leu Gly Asn Asn Asp Gly Tyr Asn Leu Ser Ala Pro Lys Gin Phe Arg 
355 360 365 



Ser Lys Glu He Ser Lys Ser Ser Val He Asp Asp He He Ala Ser 
370 375 380 



Asn Asp He Leu Tyr Asn Asp Lys Leu Gly Lys Lys Val Asp His Cys 
385 390 395 400 



He Val He Lys Tyr Met Lys Pro Val Gly Asp Ser Lys Val Ala Met 
405 410 415 



Asp Glu Tyr Tyr Ser Glu Leu Met Leu Gly Gly His Asn Arg He Ser 
420 425 430 



He His Asn Val Cys Glu Asp Ser Leu Leu Ala Thr Pro Leu He He 
435 440 445 



Asp Leu Leu Val Met Thr Glu Phe Cys Thr Arg Val Ser Tyr Lys Lys 
450 455 460 



Val Asp Pro Val Lys Glu Asp Ala Gly Lys Phe Glu Asn Phe Tyr Pro 
465 470 475 480 



Val Leu Thr Phe Leu Ser Tyr Trp Leu Lys Ala Pro Leu Thr Arg Pro 
485 490 495 



Gly Phe His Pro Val Asn Gly Leu Asn Lys Gin Arg Thr Ala Leu Glu 
500 505 510 



Asn Phe Leu Arg Leu Leu lie Gly Leu Pro Ser Gin Asn Glu Leu Arg 
515 520 525 

Phe Glu Glu Arg Leu Leu 
530 

<210> 76 

<211> 2043 

<212> DNA 

<213> Hypomyces rosellus 



<400> 76 



atgaaacacc 


ttttaacact 


cgctctttgc 


ttcagcagca 


teaatgetgt 


tgctgtcacc 


60 


gtccctcaca 


aggccgtagg 


aactggaatt 


cctgaaggga 


gtcttcagtt 


cctgagcctt 


120 


cgagcctcag 


cacctatcgg 


aagcgccatt 


tctcgcaaca 


actgggccgt 


cacttgcgac 


180 


agtgcacagt 


cgggaaatga 


atgcaacaag 


gecattgatg 


gcaacaagga 


taccttttgg 


240 


cacacattct 


atggcgccaa 


cggggatcca 


aagccccctc 


acacatacac 


gattgacatg 


300 


aagacaactc 


agaacgtcaa 


cggcttgtct 


atgctgcctc 


gacaggatgg 


taaccaaaac 


360 


ggctggatcg 


gtcgccatga 


ggtttatcta 


agctcagatg 


gcacaaactg 


gggcagccct 


420 


gttgcgtcag 


gtagttggtt 


cgccgactct 


actacaaaat 


actccaactt 


tgaaactege 


480 


cctgctcgct 


atgttcgtct 


tgtcgctatc 


actgaagega 


atggccagcc 


ttggactagc 


540 


attgcagaga 


tcaacgtctt 


ccaagctagt 


tcttacacag 


ccccccagcc 


tggtcttgga 


600 


cgctggggtc 


cgactattga 


cttaccgatt 


gttcctgegg 


ctgcagcaat 


tgaaccgaca 


660 


i— y y y y 


trrthatata 

*_ W 1— CI v 


crt ct tea tat 


cgcaatgatg 


catttggagg 


atcccctggt 


720 


ggtatcactt 


tgacgtcttc 


ctgggatcca 


tccactggta 


ttgtttccga 


ccgcactgtg 


780 


acagtcacca 


agcatgatat 


gttctgccct 


ggtatctcca 


tggatggtaa 


eggtcagate 


840 


gtagtcacag 


gtggcaacga 


tgccaagaag 


accagtttgt 


atgattcatc 


tagegatage 


900 


tggatcccgg 


gacctgacat 


gcaagtggct 


cgtgggtatc 


agtcatcagc 


taccatgtca 


960 


gacggtcgtg 


tttttaccat 


tggaggctcc 


tggagcggtg 


gcgtatttga 


gaagaatggc 


1020 


gaagtctata 


gcccatcttc 


aaagacatgg 


acgtccctac 


ccaatgccaa 


ggtcaaccca 


1080 


atgttgacgg 


ctgacaagca 


aggattgtac 


cgttcagaca 


accacgcgtg 


gctctttgga 


1140 


tggaagaagg 


gttcggtgtt 


ecaageggga 


cctagcacag 


ccatgaactg 


gtactatacc 


1200 


agtggaagtg 


gtgatgtgaa 


gtcagccgga 


aaacgccagt 


ctaaccgtgg 


tgtagcccct 


1260 


gatgccatgt 


gcggaaacgc 


tgtcatgtac 


gaegcegtta 


aaggaaagat 


cctgaccttt 


1320 



ggcggctccc cagattatca agactctgac gccacaacca acgcccacat catcaccctc 1380 

ggtgaacccg gaacatctcc caacactgtc tttgctagca atgggttgta ctttgcccga 1440 

acgtttcaca cctctgttgt tcttccagac ggaagcacgt ttattacagg aggccaacga 1500 

cgtggaattc cgttcgagga ttcaaccccg gtatttacac ctgagatcta cgtccctgaa 1560 

caagacactt tctacaagca gaaccccaac tccattgttc gcgtctacca tagcatttcc 1620 

cttttgttac ctgatggcag ggtatttaac ggtggtggtg gtctttgtgg cgattgtacc 1680 

acgaatcatt tcgacgcgca aatctttacg ccaaactatc tttacaatag caacggcaat 1740 

ctcgcgacac gtcccaagat taccagaacc tctacacaga gcgtcaaggt cggtggcaga 1800 

attacaatct cgacggattc ttcgattagc aaggcgtcgt tgattcgcta tggtacagcg 1860 

acacacacgg ttaatactga ccagcgccgc attcccctga ctctgacaaa caatggagga 1920 

aatagctatt ctttccaagt tcctagcgac tctggtgttg ctttgcctgg ctactggatg 1980 

ttgttcgtga tgaactcggc cggtgttcct agtgtggctt cgacgattcg cgttactcag 2040 

tga 2043 



<210> 77 

<211> 680 

<212> PRT 

<213> Hypomyces rosellus 

<400> 77 

Met Lys His Leu Leu Thr Leu Ala Leu Cys Phe Ser Ser lie Asn Ala 
1 5 10 15 



Val Ala Val Thr Val Pro His Lys Ala Val Gly Thr Gly lie Pro Glu 
20 25 30 



Gly Ser Leu Gin Phe Leu Ser Leu Arg Ala Ser Ala Pro lie Gly Ser 
35 40 45 



Ala lie Ser Arg Asn Asn Trp Ala Val Thr Cys Asp Ser Ala Gin Ser 
50 55 60 



Gly Asn Glu Cys Asn Lys Ala lie Asp Gly Asn Lys Asp Thr Phe Trp 
65 70 75 80 



His Thr Phe Tyr Gly Ala Asn Gly Asp Pro Lys Pro Pro His Thr Tyr 
85 90 95 



Thr lie Asp Met Lys Thr Thr Gin Asn Val Asn Gly Leu Ser Met Leu 
100 105 110 



Pro Arg Gin Asp Gly Asn Gin Asn Gly Trp lie Gly Arg His Glu Val 
115 120 125 



Tyr Leu Ser Ser Asp Gly Thr Asn Trp Gly Ser Pro Val Ala Ser Gly 
130 135 140 



Ser Trp Phe Ala Asp Ser Thr Thr Lys Tyr Ser Asn Phe Glu Thr Arg 
145 150 155 160 



Pro Ala Arg Tyr Val Arg Leu Val Ala lie Thr Glu Ala Asn Gly Gin 
165 170 175 



Pro Trp Thr Ser lie Ala Glu lie Asn Val Phe Gin Ala Ser Ser Tyr 
180 185 190 



Thr Ala Pro Gin Pro Gly Leu Gly Arg Trp Gly Pro Thr lie Asp Leu 
195 200 205 



Pro lie Val Pro Ala Ala Ala Ala lie Glu Pro Thr Ser Gly Arg Val 
210 215 220 



Leu Met Trp Ser Ser Tyr Arg Asn Asp Ala Phe Gly Gly Ser Pro Gly 
225 230 235 240 



Gly lie Thr Leu Thr Ser Ser Trp Asp Pro Ser Thr Gly lie Val Ser 
245 250 255 



Asp Arg Thr Val Thr Val Thr Lys His Asp Met Phe Cys Pro Gly lie 
260 265 270 



Ser Met Asp Gly Asn Gly Gin lie Val Val Thr Gly Gly Asn Asp Ala 
275 280 285 



Lys Lys Thr Ser Leu Tyr Asp Ser Ser Ser Asp Ser Trp lie Pro Gly 
290 295 300 



Pro Asp Met Gin Val Ala Arg Gly Tyr Gin Ser Ser Ala Thr Met Ser 
305 310 315 320 



Asp Gly Arg Val Phe Thr He Gly Gly Ser Trp Ser Gly Gly Val Phe 
325 330 335 



Glu Lys Asn Gly Glu Val Tyr Ser Pro Ser Ser Lys Thr Trp Thr Ser 
340 345 350 



Leu Pro Asn Ala Lys Val Asn Pro Met Leu Thr Ala Asp Lys Gin Gly 
355 360 365 



Leu Tyr Arg Ser Asp Asn His Ala Trp Leu Phe Gly Trp Lys Lys Gly 
370 375 380 



Ser Val Phe Gin Ala Gly Pro Ser Thr Ala Met Asn Trp Tyr Tyr Thr 
385 390 395 400 



Ser Gly Ser Gly Asp Val Lys Ser Ala Gly Lys Arg Gin Ser Asn Arg 
405 410 415 



Gly Val Ala Pro Asp Ala Met Cys Gly Asn Ala Val Met Tyr Asp Ala 
420 425 430 



Val Lys Gly Lys He Leu Thr Phe Gly Gly Ser Pro Asp Tyr Gin Asp 
435* 440 445 



Ser Asp Ala Thr Thr Asn Ala His He He Thr Leu Gly Glu Pro Gly 
450 455 460 



Thr Ser Pro Asn Thr Val Phe Ala Ser Asn Gly Leu Tyr Phe Ala Arg 
465 470 475 480 



Thr Phe His Thr Ser Val Val Leu Pro Asp Gly Ser Thr Phe He Thr 
485 490 495 



Gly Gly Gin Arg Arg Gly He Pro Phe Glu Asp Ser Thr Pro Val Phe 
500 505 510 



Thr Pro Glu He Tyr Val Pro Glu Gin Asp Thr Phe Tyr Lys Gin Asn 
515 520 525 



Pro Asn Ser He Val Arg Val Tyr His Ser He Ser Leu Leu Leu Pro 
530 535 540 



Asp Gly Arg Val Phe Asn Gly Gly Gly Gly Leu Cys Gly Asp Cys Thr 
545 550 555 560 



Thr Asn His Phe Asp Ala Gin lie Phe Thr Pro Asn Tyr Leu Tyr Asn 
565 570 575 



Ser Asn Gly Asn Leu Ala Thr Arg Pro Lys lie Thr Arg Thr Ser Thr 
580 585 590 



Gin Ser Val Lys Val Gly Gly Arg lie Thr lie Ser Thr Asp Ser Ser 
595 600 605 



lie Ser Lys Ala Ser Leu lie Arg Tyr Gly Thr Ala Thr His Thr Val 
610 615 620 



Asn Thr Asp Gin Arg Arg lie Pro Leu Thr Leu Thr Asn Asn Gly Gly 
625 630 635 640 



Asn Ser Tyr Ser Phe Gin Val Pro Ser Asp Ser Gly Val Ala Leu Pro 
645 650 655 



Gly Tyr Trp Met Leu Phe Val Met Asn Ser Ala Gly Val Pro Ser Val 
660 665 670 



Ala Ser Thr lie Arg Val Thr Gin 
675 680 



<210> 78 
<211> 2046 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic gene derived from Hypomyces rosellus galactose oxidase, 
having numerous codons replaced with others encoding the same am 
ino acids to reduce free energy of folding and a gly codon insert 
ed after the initiating met codon to insert a restriction site 

<400> 78 

atgggcaagc atctgctgac tctggcactg tgtttctctt ctatcaacgc tgttgctgta 60 

accgttccgc ataaggctgt tggtaccggt atcccggaag gttctctgca gttcctgtct 120 

ctgcgtgctt ctgctccgat cggttctgct atctctcgta acaactgggc agttacctgc 180 

gactccgcac agtctggtaa cgaatgcaac aaagctatcg acggtaacaa agacactttt 24 0 

tggcacactt tctatggcgc taacggcgac ccgaaaccgc cgcacaccta caccatcgat 300 



atgaaaacca 


ctcagaacgt 


aaacggcctg 


tctatgctgc 


cgcgccagga 


tggtaaccag 


360 


aacggttgga 


ttggtcgtca 


tgaggtatat 


ctgtcttccg 


atggtactaa 


ctggggttct 


420 


ccggtagctt 


ctggctcctg 


gttcgctgac 


tctaccacca 


aatactctaa 


cttcgagact 


480 


cgtccggcac 


gctatgtacg 


cctggttgct 


attactgagg 


caaacggtca 


gccgtggacc 


540 


tctatcgcag 


aaattaacgt 


tttccaggca 


tcttcttaca 


ccgctccgca 


gccgggtctg 


600 


ggtcgctggg 


gtccgactat 


tgacctgccg 


atcgttccgg 


cagctgctgc 


tattgagccg 


660 


acttctggtc 


gtgttctgat 


gtggtcttct 


taccgtaacg 


acgctttcgg 


tggttctccg 


720 


ggcggcatca 


ccctgacctc 


ttcttgggat 


ccgtctactg 


gcatcgtttc 


cgatcgtacc 


780 


gtaactgtta 


ctaagcacga 


tatgttttgt 


ccgggtattt 


ctatggatgg 


caacggccag 


840 


attgtagtaa 


ctggtggcaa 


cgacgctaaa 


aaaacctctc 


tgtatgattc 


ctcctctgat 


900 


tcttggatcc 


cgggtccgga 


catgcaggta 


gctcgcggtt 


atcagtcttc 


cgctactatg 


960 


tctgatggcc 


gtgttttcac 


tattggtggt 


tcttggtctg 


gcggcgtatt 


tgagaaaaac 


1020 


ggtgaagttt 


actctccatc 


ctccaaaact 


tggacttccc 


tgccgaacgc 


taaagttaac 


1080 


ccgatgctga 


ctgcagataa 


gcagggcctg 


taccgttccg 


ataaccacgc 


atggctgttt 


1140 


ggctggaaaa 


aaggctccgt 


atttcaggct 


ggtccgtcta 


ctgctatgaa 


ctggtactat 


1200 


acttctggtt 


ctggcgatgt 


taagtccgct 


ggcaagcgtc 


agtctaaccg 


tggcgtagca 


1260 


ccggatgcta 


tgtgcggtaa 


cgctgttatg 


tacgatgctg 


taaagggcaa 


gattctgact 


1320 


tttggtggct 


ctccggacta 


tcaggactcc 


gacgctacta 


ctaacgcaca 


tatcattact 


1380 


ctgggtgagc 


cgggtacctc 


tccgaacact 


gtatttgcat 


ctaacggcct 


gtactttgct 


1440 


cgtacctttc 


acacctctgt 


agtactgccg 


gatggttcca 


cttttatcac 


tggcggtcag 


1500 


cgccgcggta 


ttccgttcga 


ggactctact 


ccggttttca 


ccccggagat 


ctacgtaccg 


1560 


gagcaggata 


ctttctacaa 


gcagaacccg 


aactccattg 


ttcgtgtata 


ccactctatc 


1620 


tctctgctgc 


tgccggatgg 


tcgtgtattt 


aacggtggtg 


gtggtctgtg 


tggcgactgt 


1680 


actactaacc 


atttcgatgc 


gcagattttt 


accccgaact 


atctgtataa 


ctctaacggt 


1740 


aacctggcaa 


ctcgcccgaa 


aattactcgc 


acttctaccc 


agtctgtaaa 


ggtaggcggc 


1800 


cgtatcacca 


tctctaccga 


ctcttctatc 


tctaaagctt 


ctctgattcg 


ctatggtacc 


1860 


gctacccata 


ctgtaaacac 


tgaccagcgt 


cgtatcccgc 


tgaccctgac 


caacaacggt 


1920 


ggtaactctt 


actcttttca 


ggttccgtct 


gactctggtg 


ttgctctgcc 


gggttactgg 


1980 




atgctgttcg ttatgaactc tgctggtgtt ccgtctgttg cttctaccat ccgtgttacc 2040 



<210> 79 

<211> 681 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic protein derived from Hypomyces rosellus galactose oxida 



se, having a glycine residue inserted after the initiating methio 



cagtag 



2046 



nine 



<400> 



79 



Met Gly Lys His Leu Leu Thr Leu Ala Leu Cys Phe Ser Ser lie Asn 
15 10 15 



Ala Val Ala Val Thr Val Pro His Lys Ala Val Gly Thr Gly lie Pro 
20 25 30 



Glu Gly Ser Leu Gin Phe Leu Ser Leu Arg Ala Ser Ala Pro lie Gly 
35 40 45 



Ser Ala lie Ser Arg Asn Asn Trp Ala Val Thr Cys Asp Ser Ala Gin 
50 55 60 



Ser Gly Asn Glu Cys Asn Lys Ala lie Asp Gly Asn Lys Asp Thr Phe 
65 70 75 80 



Trp His Thr Phe Tyr Gly Ala Asn Gly Asp Pro Lys Pro Pro His Thr 
85 90 95 



Tyr Thr lie Asp Met Lys Thr Thr Gin Asn Val Asn Gly Leu Ser Met 
100 105 110 



Leu Pro Arg Gin Asp Gly Asn Gin Asn Gly Trp lie Gly Arg His Glu 
115 120 125 



Val Tyr Leu Ser Ser Asp Gly Thr Asn Trp Gly Ser Pro Val Ala Ser 
130 135 140 



Gly Ser Trp Phe Ala Asp Ser Thr Thr Lys Tyr Ser Asn Phe Glu Thr 
145 150 155 160 



Arg Pro Ala Arg Tyr Val Arg Leu Val Ala lie Thr Glu Ala Asn Gly 
165 170 175 



Gin Pro Trp Thr Ser lie Ala Glu lie Asn Val Phe Gin Ala Ser Ser 
180 185 190 



Tyr Thr Ala Pro Gin Pro Gly Leu Gly Arg Trp Gly Pro Thr lie Asp 
195 200 205 



Leu Pro lie Val Pro Ala Ala Ala Ala lie Glu Pro Thr Ser Gly. Arg 
210 215 220 



Val Leu Met Trp Ser Ser Tyr Arg Asn Asp Ala Phe Gly Gly Ser Pro 
225 230 . 235 240 



Gly Gly lie Thr Leu Thr Ser Ser Trp Asp Pro Ser Thr Gly lie Val 
245 250 255 



Ser Asp Arg Thr Val Thr Val Thr Lys His Asp Met Phe Cys Pro Gly 
260 265 270 



lie Ser Met Asp Gly Asn Gly Gin lie Val Val Thr Gly Gly Asn Asp 
275 280 285 



Ala Lys Lys Thr Ser Leu Tyr Asp Ser Ser Ser Asp Ser Trp lie Pro 
290 295 300 



Gly Pro Asp Met Gin Val Ala Arg Gly Tyr Gin Ser Ser Ala Thr Met 
305 310 315 320 



Ser Asp Gly Arg Val Phe Thr lie Gly Gly Ser Trp Ser Gly Gly Val 
325 330 335 



Phe Glu Lys Asn Gly Glu Val Tyr Ser Pro Ser Ser Lys Thr Trp Thr 
340 345 350 



Ser Leu Pro Asn Ala Lys Val Asn Pro Met Leu Thr Ala Asp Lys Gin 
355 360 365 



Gly Leu Tyr Arg Ser Asp Asn His Ala Trp Leu Phe Gly Trp Lys Lys 
370 375 380 



Gly Ser Val Phe Gin Ala Gly Pro Ser Thr Ala Met Asn Trp Tyr Tyr 



• 



385 



390 



395 



400 



Thr Ser Gly Ser Gly Asp Val Lys Ser Ala Gly Lys Arg Gin Ser Asn 
405 410 415 



Arg Gly Val Ala Pro Asp Ala Met Cys Gly Asn Ala Val Met Tyr Asp 
420 425 430 



Ala Val Lys Gly Lys lie Leu Thr Phe Gly Gly Ser Pro Asp Tyr Gin 
435 440 445 



Asp Ser Asp Ala Thr Thr Asn Ala His lie lie Thr Leu Gly Glu Pro 
450 455 460 



Gly Thr Ser Pro Asn Thr Val Phe Ala Ser Asn Gly Leu Tyr Phe Ala 
465 470 475 480 



Arg Thr Phe His Thr Ser Val Val Leu Pro Asp Gly Ser Thr Phe lie 
485 490 495 



Thr Gly Gly Gin Arg Arg Gly lie Pro Phe Glu Asp Ser Thr Pro Val 
500 505 510 



Phe Thr Pro Glu lie Tyr Val Pro Glu Gin Asp Thr Phe Tyr Lys Gin 
515 520 525 



Asn Pro Asn Ser lie Val Arg Val Tyr His Ser lie Ser Leu Leu Leu 
530 535 540 



Pro Asp Gly Arg Val Phe Asn Gly Gly Gly Gly Leu Cys Gly Asp Cys 
545 550 555 560 



Thr Thr Asn His Phe Asp Ala Gin lie Phe Thr Pro Asn Tyr Leu Tyr 
565 570 575 



Asn Ser Asn Gly Asn Leu Ala Thr Arg Pro Lys lie Thr Arg Thr Ser 
580 585 590 



Thr Gin Ser Val Lys Val Gly Gly Arg lie Thr lie Ser Thr Asp Ser 
595 600 605 



Ser lie Ser Lys Ala Ser Leu lie Arg Tyr Gly Thr Ala Thr His Thr 
610 615 620 



Val Asn Thr Asp Gin Arg Arg lie Pro Leu Thr Leu Thr Asn Asn Gly 
625 630 635 640 



Gly Asn Ser Tyr Ser Phe Gin Val Pro Ser Asp Ser Gly Val Ala Leu 
645 650 655 



Pro Gly Tyr Trp Met Leu Phe Val Met Asn Ser Ala Gly Val Pro Ser 
660 665 670 



Val Ala Ser Thr lie Arg Val Thr Gin 
675 680 



